StupidFilter is "an open-source filter software that can detect rampant stupidity in written English. This will be accomplished with weighted Bayesian or similar analysis and some rules-based processing, similar to spam detection engines. ... To this end, we're collecting a ranked corpus of stupid text, gleaned from user comments on public websites and ranked on a five-point scale."
I wonder, to what extent can stupidity be modeled with a unigram distribution? What is the overall distribution of stupid comments? And how many random stupid comments does the average person look at before moving on?
No comments:
Post a Comment