Alright I'm trying to develop this package for quantitative discourse analysis. Anyway I decide i need to do syllable counts (my ultimate goal) and found out this is pretty darn complex SO LINK. So I read a bunch of articles and find out entire dissertations can be devoted to this. I'm not interested in reinventing the wheel so I look at what's been done already that's good and I find a site that http://www.syllablecount.com is pretty darn good, maybe even the best. I also know latex uses an algorithm to split words into syllables (This can be useful for readability statistics and such).
So this is what I've got so far:
1) a website that does syllabication pretty well
2) LATEX does syllabication pretty well too
I want to use either one to create an R function that will do syllable counts for a vector of words. So for instance:
would yield: 1, 1, 2, 2, 1, 3, 5
The web based one is more accurate than the conservative LATEX solution. I'm open to any solution that gets the job done reasonably well (95% accuracy) and fairly fast. Ideally the solution would not be web based but if I don't want to reinvent the wheel and still want to be accurate I may have to got hat route.
What I need help with is making this work. Questions I have:
1) Is it possible to use a web application like this through R?
2) Is it possible to harness and use the algorithm of LATEX (basically call latex to compute the syllable counts) with R?
3) Is either legal?
I' used to using R at a pretty basic level so I don;t even know what questions to ask here. Maybe there's a simple answer I'm missing.
So this is what I've got so far:
1) a website that does syllabication pretty well
2) LATEX does syllabication pretty well too
I want to use either one to create an R function that will do syllable counts for a vector of words. So for instance:
Code:
x <- c('dog', 'cat', 'pony', 'cracker', 'shoe', 'Popsicle', 'pronunciation' )
The web based one is more accurate than the conservative LATEX solution. I'm open to any solution that gets the job done reasonably well (95% accuracy) and fairly fast. Ideally the solution would not be web based but if I don't want to reinvent the wheel and still want to be accurate I may have to got hat route.
What I need help with is making this work. Questions I have:
1) Is it possible to use a web application like this through R?
2) Is it possible to harness and use the algorithm of LATEX (basically call latex to compute the syllable counts) with R?
3) Is either legal?
I' used to using R at a pretty basic level so I don;t even know what questions to ask here. Maybe there's a simple answer I'm missing.