HAL will be down for maintenance from Friday, June 10 at 4pm through Monday, June 13 at 9am. More information
Skip to Main content Skip to Navigation
Conference papers

Breaking reCAPTCHA: A Holistic Approach via Shape Recognition

Abstract : CAPTCHAs are small puzzles which should be easily solvable by human beings but hard to solve for computers. They build a security cornerstone of the modern Internet service landscape, deployed in essentially any kind of login service, allowing to distinguish authorized human beings from automated attacks. One of the most popular and successful systems today is reCAPTCHA. As many other systems, reCAPTCHA is based on distorted images of words, where the distortion system evolves over time and determines different generations of the system. In this work, we analyze three recent generations of reCAPTCHA and present an algorithm that is capable of solving at least 5% of the challenges generated by these versions.We achieve this by applying a specialized variant of shape contexts proposed by Belongie et al. to match entire words at once. In order to handle the ellipse shaped distortions employed in one of the generations, we propose a machine learning algorithm that virtually eliminates the distortion. Finally, an improved shape matching strategy allows us to use word dictionaries of a reasonable size (with approximately 20,000 entries).
Document type :
Conference papers
Complete list of metadata

Cited literature [16 references]  Display  Hide  Download

Contributor : Hal Ifip Connect in order to contact the contributor
Submitted on : Monday, July 24, 2017 - 10:40:12 AM
Last modification on : Monday, July 24, 2017 - 10:42:16 AM


Files produced by the author(s)


Distributed under a Creative Commons Attribution 4.0 International License



Paul Baecher, Niklas Büscher, Marc Fischlin, Benjamin Milde. Breaking reCAPTCHA: A Holistic Approach via Shape Recognition. 26th International Information Security Conference (SEC), Jun 2011, Lucerne, Switzerland. pp.56-67, ⟨10.1007/978-3-642-21424-0_5⟩. ⟨hal-01567589⟩



Record views


Files downloads