Search results
6 packages found
Sort by: Default
- Default
- Most downloaded this week
- Most downloaded this month
- Most dependents
- Recently published
Get n-grams from text
published version 2.0.2, 3 years ago23 dependents licensed under $MIT
473,130
Get n-grams from text
published version 1.1.2, 5 years ago1 dependents licensed under $MIT
51,226
Word prediction for T9 keyboard.
published version 3.1.3, 4 years ago0 dependents licensed under $MIT
41
The 1/3 million most frequent words, all lowercase, with counts.
published version 1.1.1, 4 years ago0 dependents licensed under $MIT
13
Fast and versatile tokenizer for language models, supporting BPE, Unigram and WordPiece tokenization
published version 0.10.1, 6 months ago0 dependents licensed under $BSD-2-Clause
12
Feature hashing, also known as the hashing trick, a fast and space-efficient way of vectorizing features.
- machine learning
- bag of words
- feature vector
- natural language processing
- nlp
- bow
- document classification
- information retrieval
- sparse vector
- ml
- classifier
- regression
- hash
- md5
- View more
published version 1.0.0, 10 years ago0 dependents licensed under $MIT
3