Jump to content

Hey guys. Part of the program that I'm making involves retrieving google image results. Google deprecated their API which provided an easy way to do this. So as a result, I'm going to be using JSoup (I'm programming in Java) to parse the results. I've heard though that Google doesn't like to be scraped and will potentially ban your IP. I'm not a super experienced programmer and I know virtually nothing about scraping or web development or any of that kind of thing, but I was thinking that the easiest way to get around this is to change my IP every 10 searches or so. Is this a good strategy? I'd love if someone could give me some pointers on how to do this effectively. Thanks so much! 

Link to comment
https://linustechtips.com/topic/677756-google-image-scraping/
Share on other sites

Link to post
Share on other sites

From the google image API page about deprecation:

The Google Image Search API has been officially deprecated as of May 26, 2011. It will continue to work as per our deprecation policy, but the number of requests you may make per day may be limited. We encourage you to use the Custom Search API, which now supports image search.

I would recommend simply trying the custom search API, and if that doesn't work, come back and I'll see what I can do.

˙ǝɯᴉʇ ɹnoʎ ƃuᴉʇsɐʍ ǝɹɐ noʎ 'sᴉɥʇ pɐǝɹ oʇ ƃuᴉʎɹʇ ǝɹɐ noʎ ɟI

Link to comment
https://linustechtips.com/topic/677756-google-image-scraping/#findComment-8725297
Share on other sites

Link to post
Share on other sites

Create an account or sign in to comment

You need to be a member in order to leave a comment

Create an account

Sign up for a new account in our community. It's easy!

Register a new account

Sign in

Already have an account? Sign in here.

Sign In Now

×