-
-
-
-
-
-
bilingual-subword Public
Bilingual Subword Segmentation for Neural Machine Translation (Deguchi et al., 2020)
-
transformers Public
Forked from huggingface/transformers🤗 Transformers: State-of-the-art Machine Learning for Pytorch, TensorFlow, and JAX.
Python Apache License 2.0 UpdatedJul 28, 2024 -
sacremoses Public
Forked from hplt-project/sacremosesPython port of Moses tokenizer, truecaser and normalizer
Python MIT License UpdatedMay 26, 2024 -
-
dbsa Public
Dependency-Based Self-Attention for Transformer NMT
-
-
math_writing Public
Forked from mti-lab/math_writingIntroduction to mathematical writing for undergraduate students majoring in engineering
TeX MIT License UpdatedApr 19, 2023 -
tensor2tensor Public
Forked from tensorflow/tensor2tensorLibrary of deep learning models and datasets designed to make deep learning more accessible and accelerate ML research.
Python Apache License 2.0 UpdatedFeb 17, 2023 -
-
fairseq Public
Forked from facebookresearch/fairseqFacebook AI Research Sequence-to-Sequence Toolkit written in Python.
Python MIT License UpdatedJul 13, 2022 -
mtdata Public
Forked from thammegowda/mtdataA tool that locates, downloads, and extracts machine translation corpora
Python Apache License 2.0 UpdatedJun 26, 2022 -
pyrallis Public
Forked from eladrich/pyrallisPyrallis is a framework for structured configuration parsing from both cmd and files. Simply define your desired configuration structure as a dataclass and let pyrallis do the rest!
Python MIT License UpdatedJun 10, 2022 -
-
-
-
-
nanopq Public
Forked from matsui528/nanopqPure python implementation of product quantization for nearest neighbor search
Python MIT License UpdatedApr 1, 2022 -
rii Public
Forked from matsui528/riiFast and memory-efficient ANN with a subset-search functionality
Python MIT License UpdatedSep 12, 2021 -
git-rsync Public
Synchronize the git repository via rsync.
-
gpustat Public
Forked from wookayin/gpustat📊 A simple command-line utility for querying and monitoring GPU status
Python MIT License UpdatedMar 23, 2021 -
-
stanza Public
Forked from stanfordnlp/stanzaOfficial Stanford NLP Python Library for Many Human Languages
Python Other UpdatedJan 25, 2021 -
sentencepiece Public
Forked from google/sentencepieceUnsupervised text tokenizer for Neural Network-based text generation.
C++ Apache License 2.0 UpdatedJan 4, 2021 -
-
kytea Public
Forked from neubig/kyteaThe Kyoto Text Analysis Toolkit for word segmentation and pronunciation estimation, etc.
C++ Other UpdatedJan 29, 2020