Dictionary
Thesaurus
Reference
Translate
Web
Document classification - 1 reference results
Wikipedia
Document classification/categorization is a problem in information science. The task is to assign an electronic document to one or more categories, based on its contents. Document classification tasks can be divided into two sorts: supervised document classification where some external mechanism (such as human feedback) provides information on the correct classification for documents, and unsupervised document classification, where the classification must be done entirely without reference to external information. There is also a semi-supervised document classification, where parts of the documents are labeled by the external mechanism.

Techniques

Document classification techniques include:

and approaches based on natural language processing.

Applications

Classification techniques have been applied to spam filtering, a process which tries to discern E-mail spam messages from legitimate emails.

See also

Further reading

Publications:

Data sets:

Share :Share This: digg.comShare This: www.stumbleupon.comShare This: del.icio.usShare This: FacebookShare This: favorites.live.comShare This: www.technorati.comShare This: furl.netShare This: www.myspace.comShare This: www.google.comShare This: myweb2.search.yahoo.comShare This: myjeeves.ask.com
Search another word or see Document classification on Dictionary | Thesaurus | Translate
Get your FREE Subscription to Dictionary.com Word of the Day
The FREE Dictionary.com Toolbar
Dictionary Thesaurus Reference
The answers are right on your browser and just a click away with Dictionary.com Toolbar.