The reCaptcha Helper: A Machine Learning Study

نویسندگان

  • Guangjie Shi
  • Yuchen Ying
  • Yue Yin
چکیده

reCaptcha is a very popular CAPTCHA system on the Internet. Due to its design, we find that reCaptcha figures have certain feature that can potentially weaken this system. This paper discussed the design of reCaptcha, and use some simple Machine Learning algorithm to find a way to tell the difference between “control word” and “unknown word”. We also designed the experiment to evaluate our hypothesis and get a very good result. Keywords-CAPTCHA, reCaptcha, Machine Learning

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Breaking reCAPTCHA: A Holistic Approach via Shape Recognition

CAPTCHAs are small puzzles which should be easily solvable by human beings but hard to solve for computers. They build a security cornerstone of the modern Internet service landscape, deployed in essentially any kind of login service, allowing to distinguish authorized human beings from automated attacks. One of the most popular and successful systems today is reCAPTCHA. As many other systems, ...

متن کامل

Enabling Configuration-Independent Automation by Non-Expert Users

The Internet has allowed collaboration on an unprecedented scale. Wikipedia, Luis Von Ahn’s ESP game, and reCAPTCHA have proven that tasks typically performed by expensive in-house or outsourced teams can instead be delegated to the mass of Internet computer users. These success stories show the opportunity for crowd-sourcing other tasks, such as allowing computer users to help each other answe...

متن کامل

I’m not a human: Breaking the Google reCAPTCHA

Since their inception, captchas have been widely used for preventing fraudsters from performing illicit actions. Nevertheless, economic incentives have resulted in an arms race, where fraudsters develop automated solvers and, in turn, captcha services tweak their design to break the solvers. Recent work, however, presented a generic attack that can be applied to any text-based captcha scheme. F...

متن کامل

Geo-reCAPTCHA: Crowdsourcing large amounts of geographic information from earth observation data

The reCAPTCHA concept provides a large amount of valuable information for various applications. First, it provides security, e.g. for a form on a website, by means of a test that only a human could solve. Second, the effort of the user for this test is used to generate additional information, e.g. digitisation of books or identification of house numbers. In this work, we present a concept for a...

متن کامل

Machine learning algorithms in air quality modeling

Modern studies in the field of environment science and engineering show that deterministic models struggle to capture the relationship between the concentration of atmospheric pollutants and their emission sources. The recent advances in statistical modeling based on machine learning approaches have emerged as solution to tackle these issues. It is a fact that, input variable type largely affec...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2013