Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Detect bots and scripts #36

Open
tmaiaroto opened this issue Aug 8, 2014 · 0 comments
Open

Detect bots and scripts #36

tmaiaroto opened this issue Aug 8, 2014 · 0 comments
Milestone

Comments

@tmaiaroto
Copy link
Member

Twitter (mostly, but not only) is full of spam accounts managed by bots/scripts. We can detect these by a set of rules:

  • Look at the profile picture, if it's still the default...that's a strike against the account
  • Look at number of followers to following ratio (in the case of twitter)
  • Look at what is being posted (run your typical spam filter over the message - or maybe atypical, train naive bayesian classifier for this too)
  • Look at the name, I've seen things like "Bot" in the name for people announcing the fact that the account is automated (not always going to be the case, but does happen)

Then put another field on the message series (and probably shared links, if not all series) that flags possible robot. We'll keep the data, but it would be nice to be able to filter out suspicious accounts for greater accuracy.

@tmaiaroto tmaiaroto added this to the Backlog milestone Aug 8, 2014
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Projects
None yet
Development

No branches or pull requests

1 participant