What naughty word list is good to fight spam? -
i have simple spam filter mechanism in place uses list of naughty words spam (i use these post content user profile fields etc).
i have:
array ('shop','bags','shoes','shag','watches','sales','health','insurance','trader','wedding','casino','hack','ps3','cheap','episode','accessories','movie','nobod.info')
what lists using? i'm adding/changing words time, seeing other lists big help!
blacklisting words not work effectively. machine learning techniques useful here. is, @ messages marked explicitly spam, , let computer learn spam messages like. mail sites gmail use fight against spam. lot of work, reliable way fight spam (when last time saw spam in gmail inbox?)
blacklisting words these have high false positive rate, annoying.
Comments
Post a Comment