this sounds like an interesting project. really, though, whether one source or another is "credible" isn't a stats question, but rather a subject domain question
out of curiosity, if you're able to say, is this for work, a website, school, etc.? I'm interested in web scraping, too, that's why I ask. I'm aware of some R functionalities that exist