| dc.contributor.author | 
Paijmans, H 
 | 
 | 
| dc.date.accessioned | 
2018-05-30T14:18:48Z | 
 | 
| dc.date.available | 
2018-05-30T14:18:48Z | 
 | 
| dc.date.issued | 
1995 | 
 | 
| dc.identifier.citation | 
Paijmans H (1995) Word frequencies in scientific prose. South African Computer Journal, Number 14, 1995 | 
en | 
| dc.identifier.issn | 
2313-7835 | 
 | 
| dc.identifier.uri | 
http://hdl.handle.net/10500/24177 | 
 | 
| dc.description.abstract | 
An overview is given of popular term-weighting schemes as used in Information Retrieval. The question is raised whether the word frequency based  t f.idf weights may be used as a yardstick to compare schemes that are not based on frequency-characteristics, such as the first mention theory or schemes based on word categories. It is demonstrated that word categories and frequency derived weights have a close correlation but that weighting according to the first mentioned theory shows no agreement with frequency-based weights. | 
en | 
| dc.language.iso | 
en | 
en | 
| dc.publisher | 
South African Computer Society (SAICSIT) | 
en | 
| dc.subject | 
Information retrieval | 
en | 
| dc.subject | 
Word frequencies | 
en | 
| dc.subject | 
SMART | 
en | 
| dc.subject | 
Full-text documents | 
en | 
| dc.title | 
Word frequencies in scientific prose | 
en | 
| dc.type | 
Article | 
en |