Search results

6 packages found

Get n-grams from text

published version 2.0.2, 3 years ago23 dependents licensed under $MIT
473,130

Get n-grams from text

published version 1.1.2, 5 years ago1 dependents licensed under $MIT
51,226

Word prediction for T9 keyboard.

published version 3.1.3, 4 years ago0 dependents licensed under $MIT
41

The 1/3 million most frequent words, all lowercase, with counts.

published version 1.1.1, 4 years ago0 dependents licensed under $MIT
13

Fast and versatile tokenizer for language models, supporting BPE, Unigram and WordPiece tokenization

published version 0.10.1, 6 months ago0 dependents licensed under $BSD-2-Clause
12

Feature hashing, also known as the hashing trick, a fast and space-efficient way of vectorizing features.

published version 1.0.0, 10 years ago0 dependents licensed under $MIT
3