Jump to content

GitHub user reverse engineered Google's new “captchaless” ReCaptcha system

Original article:

 

https://github.com/ReCaptchaReverser/InsideReCaptcha

 

Technical Highlights:

  • To obfuscate the code, Google created a new VM based language run inside Javascript
  • The code run in this VM is encrypted
  • The encrypted code can dynamically change it's own encryption key at runtime
  • The VM code can also dynamically change what VM instructions do​

What data Google collects:

  • Browser Plug-Ins
  • User-Agent
  • Screen resolution
  • Timestamp
  • Time zone
  • Cookies
  • Number of click/keyboard/touch actions
  • Known browser-specific CSS quirks
  • Tests HTML5 canvas support

 

EDIT1: Remove duplicate item

[22:56] <Rehclip> Goddamnit hes right? Or goddamnit you're still on the computer chatting on IRC while your house is ablaze?

Link to comment
Share on other sites

Link to post
Share on other sites

lol gg google


CPU: Intel i5 4570 | Cooler: Cooler Master TPC 812 | Motherboard: ASUS H87M-PRO | RAM: G.Skill 16GB (4x4GB) @ 1600MHZ | Storage: OCZ ARC 100 480GB, WD Caviar Black 2TB, Caviar Blue 1TB | GPU: Gigabyte GTX 970 | ODD: ASUS BC-12D2HT BR Reader | PSU: Cooler Master V650 | Display: LG IPS234 | Keyboard: Logitech G710+ | Mouse: Logitech G602 | Audio: Logitech Z506 & Audio Technica M50X | My machine: https://nz.pcpartpicker.com/b/JoJ

Link to comment
Share on other sites

Link to post
Share on other sites

Just queueing you in, screen res is listed twice.

Main Rig: CPU: AMD Ryzen 7 5800X | RAM: 32GB (2x16GB) KLEVV CRAS XR RGB DDR4-3600 | Motherboard: Gigabyte B550I AORUS PRO AX | Storage: 512GB SKHynix PC401, 1TB Samsung 970 EVO Plus, 2x Micron 1100 256GB SATA SSDs | GPU: EVGA RTX 3080 FTW3 Ultra 10GB | Cooling: ThermalTake Floe 280mm w/ be quiet! Pure Wings 3 | Case: Sliger SM580 (Black) | PSU: Lian Li SP 850W

 

Server: CPU: AMD Ryzen 3 3100 | RAM: 32GB (2x16GB) Crucial DDR4 Pro | Motherboard: ASUS PRIME B550-PLUS AC-HES | Storage: 128GB Samsung PM961, 4TB Seagate IronWolf | GPU: AMD FirePro WX 3100 | Cooling: EK-AIO Elite 360 D-RGB | Case: Corsair 5000D Airflow (White) | PSU: Seasonic Focus GM-850

 

Miscellaneous: Dell Optiplex 7060 Micro (i5-8500T/16GB/512GB), Lenovo ThinkCentre M715q Tiny (R5 2400GE/16GB/256GB), Dell Optiplex 7040 SFF (i5-6400/8GB/128GB)

Link to comment
Share on other sites

Link to post
Share on other sites

so does this mean this guy just gave a cheat sheet for bots to pass this captcha and make it useless?

this is one of the greatest thing that has happened to me recently, and it happened on this forum, those involved have my eternal gratitude http://linustechtips.com/main/topic/198850-update-alex-got-his-moto-g2-lets-get-a-moto-g-for-alexgoeshigh-unofficial/ :')

i use to have the second best link in the world here, but it died ;_; its a 404 now but it will always be here

 

Link to comment
Share on other sites

Link to post
Share on other sites

so does this mean this guy just gave a cheat sheet for bots to pass this captcha and make it useless?

Since the verification is done server-side, we may never know the exact requirements to pass the test.

All this has done is discover what data is collected on the client side, and how it is sent back to Google.

[22:56] <Rehclip> Goddamnit hes right? Or goddamnit you're still on the computer chatting on IRC while your house is ablaze?

Link to comment
Share on other sites

Link to post
Share on other sites

Pretty neat. As far as I read in the github Summary, he hasn't completely figured how it completely works, but has the idea. Might be a matter of time before someone is able to bypass it by sending a bytecode.

Link to comment
Share on other sites

Link to post
Share on other sites

so does this mean this guy just gave a cheat sheet for bots to pass this captcha and make it useless?

 

 

Since the verification is done server-side, we may never know the exact requirements to pass the test.

All this has done is discover what data is collected on the client side, and how it is sent back to Google.

 

even if a cheat sheet was released im sure the algorithms would change so quickly it would be useless to try and keep the bot uptodate. just like trying to get your site listed higher in google search results... the algorithms change so rapidly that anything you do to try and exploit them wont work the next day

Case: NZXT Phantom PSU: EVGA G2 650w Motherboard: Asus Z97-Pro (Wifi-AC) CPU: 4690K @4.2ghz/1.2V Cooler: Noctua NH-D15 Ram: Kingston HyperX FURY 16GB 1866mhz GPU: Gigabyte G1 GTX970 Storage: (2x) WD Caviar Blue 1TB, Crucial MX100 256GB SSD, Samsung 840 SSD Wifi: TP Link WDN4800

 

Donkeys are love, Donkeys are life.                    "No answer means no problem!" - Luke 2015

 

Link to comment
Share on other sites

Link to post
Share on other sites

Even though Google is probably trying to clone America with the info they collect, they're still better than most other companies in similar spaces.

Link to comment
Share on other sites

Link to post
Share on other sites

  • Browser Plug-Ins
  • User-Agent
  • Screen resolution
  • Timestamp
  • Time zone
  • Cookies

 

hmm this is worrying many privacy based OSes that come with TOR have these settings exactly the same

example would be Whonix - i wonder if it would be called a bot

If your grave doesn't say "rest in peace" on it You are automatically drafted into the skeleton war.

Link to comment
Share on other sites

Link to post
Share on other sites

hmm this is worrying many privacy based OSes that come with TOR have these settings exactly the same

example would be Whonix - i wonder if it would be called a bot

If it thinks you're a bot, it just asks you to complete a traditional warped-test captcha.

[22:56] <Rehclip> Goddamnit hes right? Or goddamnit you're still on the computer chatting on IRC while your house is ablaze?

Link to comment
Share on other sites

Link to post
Share on other sites

If it thinks you're a bot, it just asks you to complete a traditional warped-test captcha.

i thought it requires an email ?

If your grave doesn't say "rest in peace" on it You are automatically drafted into the skeleton war.

Link to comment
Share on other sites

Link to post
Share on other sites

Well that's one way Google can find out what they need to do to make their services better I guess. Although I'd much rather a voluntary survey than a forced one...

Link to comment
Share on other sites

Link to post
Share on other sites

Since the verification is done server-side, we may never know the exact requirements to pass the test.

All this has done is discover what data is collected on the client side, and how it is sent back to Google.

Which means you can extrapolate what the tests are and eventually crack the system. Google really screwed the pooch on this one.

Software Engineer for Suncorp (Australia), Computer Tech Enthusiast, Miami University Graduate, Nerd

Link to comment
Share on other sites

Link to post
Share on other sites

i thought it requires an email ?

Nope. If you fool it into thinking your a bot , its a simple captcha.

And fooling it isn't even difficult.

A riddle wrapped in an enigma , shot to the moon and made in China

Link to comment
Share on other sites

Link to post
Share on other sites

Which means you can extrapolate what the tests are and eventually crack the system. Google really screwed the pooch on this one.

Well mabye its a server thing to stop bots. So if suddenly a whole bunch of requests come in systymatically , Then it is very strict on the human testing.

But really, there is no way to make a fool proof captcha since bots are becoming better at being human then humans.

I mean they already read Captchas better than humans.

A riddle wrapped in an enigma , shot to the moon and made in China

Link to comment
Share on other sites

Link to post
Share on other sites

Even though Google is probably trying to clone America with the info they collect, they're still better than most other companies in similar spaces.

 

I disagree, I think Google is just as bad as anyone else. They're a marketing/ad company first and I'd trust companies like Microsoft more with personal data.

Link to comment
Share on other sites

Link to post
Share on other sites

I disagree, I think Google is just as bad as anyone else. They're a marketing/ad company first and I'd trust companies like Microsoft more with personal data.

I mistrust Microsoft for their incompetence and Google more for potential malice.

Link to comment
Share on other sites

Link to post
Share on other sites

I mistrust Microsoft for their incompetence and Google more for potential malice.

Found my senior yearbook quote.

[22:56] <Rehclip> Goddamnit hes right? Or goddamnit you're still on the computer chatting on IRC while your house is ablaze?

Link to comment
Share on other sites

Link to post
Share on other sites

i thought it requires an email ?

 

The email in the announcement was because the captcha was for a forum sign-up or something. You won't need an email address.

Intel i7 5820K (4.5 GHz) | MSI X99A MPower | 32 GB Kingston HyperX Fury 2666MHz | Asus RoG STRIX GTX 1080ti OC | Samsung 951 m.2 nVME 512GB | Crucial MX200 1000GB | Western Digital Caviar Black 2000GB | Noctua NH-D15 | Fractal Define R5 | Seasonic 860 Platinum | Logitech G910 | Sennheiser 599 | Blue Yeti | Logitech G502

 

Nikon D500 | Nikon 300mm f/4 PF  | Nikon 200-500 f/5.6 | Nikon 50mm f/1.8 | Tamron 70-210 f/4 VCII | Sigma 10-20 f/3.5 | Nikon 17-55 f/2.8 | Tamron 90mm F2.8 SP Di VC USD Macro | Neewer 750II

Link to comment
Share on other sites

Link to post
Share on other sites

If it thinks you're a bot, it just asks you to complete a traditional warped-test captcha.

One day they will turn off the traditional captchas because they don't work (that's the reason they gave for inventing the "captchless" system). Then you can choose between letting google spy on you and not using any service with "captchless" protection.

 

Also, github repo seems to be down. Looks like someone didn't like their secret spying machine being reversed.

 

edit: Yup, google doesn't like it!

The code you reversed is used to protect many sites’ registration process including Google and many others. We are concerned that having your code and analysis publicly available will make it easier to build registration automation tools which will result in a surge of spam in all the services protected by this code and will affect negatively many Internet users.

This is why we kindly ask you to temporarily remove it from GitHub so your work won’t be used for a malicious purpose which we believe was never your intended goal.

http://www.reddit.com/r/netsec/comments/2or9e3/reverseengineering_the_new_captchaless_recaptcha/cmqna04

 

Got to love Google. Fucktards. Captcha didn't work, Captcha doesn't work, Captcha will never work.

Link to comment
Share on other sites

Link to post
Share on other sites

Well... that list of data collected looks identical to (if not shorter than) the data that just about every page on the globe collect from users through the Analytics script. Fun stuff to know of nonetheless.

Cheers,

Linus

Link to comment
Share on other sites

Link to post
Share on other sites

Just as I thought; Google has to "learn" about you to figure out if you are a robot. This seems to put the users at an ultimatum, giving up precious data to the advertisers (and the likes of the NSA), or do not bother registering for anything potentially useful ever again. And with the security tools (rightfully) blocking it, it only adds to the problem.

 

And I was wondering why it was not showing up on my browsers...

Read the community standards; it's like a guide on how to not be a moron.

 

Gerdauf's Law: Each and every human being, without exception, is the direct carbon copy of the types of people that he/she bitterly opposes.

Remember, calling facts opinions does not ever make the facts opinions, no matter what nonsense you pull.

Link to comment
Share on other sites

Link to post
Share on other sites

Well... that list of data collected looks identical to (if not shorter than) the data that just about every page on the globe collect from users through the Analytics script. Fun stuff to know of nonetheless.

Which you can easily block. Try blocking the captcha if you want to use the service "protected" by the captcha.

 

Also, found a mirror (https://github.com/toogle/InsideReCaptcha) and pulled the code locally.

Link to comment
Share on other sites

Link to post
Share on other sites

Create an account or sign in to comment

You need to be a member in order to leave a comment

Create an account

Sign up for a new account in our community. It's easy!

Register a new account

Sign in

Already have an account? Sign in here.

Sign In Now

×