-
Train-32B-LLM Public
Model Parallelism, Full Parameter, One Machine Multi GPUs
Python MIT License UpdatedJan 4, 2026 -
-
BERT-pre-training Public
multi-gpu pre-training in one machine for BERT without horovod (Data Parallelism)
-
Train-8B-LLM Public
Data Parallelism, Full Parameter, One Machine Multi GPUs
-
-
Automatic Label Error Correction www.techrxiv.org/users/679328/articles/731085
-
-
-
-
-
-
NL2SQL-RULE Public
Content Enhanced BERT-based Text-to-SQL Generation https://arxiv.org/abs/1910.07179
-
code2text_LLM_text2code Public
code2text --> LLM --> text2code
Python MIT License UpdatedSep 20, 2024 -
code2image_LM_image2code Public
code2image + code2text --> image2text --> text2code
MIT License UpdatedSep 20, 2024 -
-
Semantic-Tree-Search Public
Revisiting Semantic Representation and Tree Search for Similar Question Retrieval https://arxiv.org/abs/1908.08326
Python UpdatedAug 29, 2023 -
TensorFlow-Study-From-Zero Public
TensorFlow等的Model的中文注释
-
run_movielens_dataset Public
Get AUC 0.794 at Movielens 20M dataset
-
run_criteo_dataset Public
Get AUC 0.809 at Criteo dataset by MLP
-
bert_compare Public
A Comprehensive Comparison of Pre-training Language Models https://arxiv.org/abs/2106.11483
Python UpdatedOct 28, 2022 -
text-style-transfer-chinese Public
金庸和古龙之间的文本风格转换
-
write my own neural network
-
Using Database Rule for Weak Supervised Text-to-SQL Generation https://arxiv.org/abs/1907.00620
-
table2answer Public
Table2answer: Read the database and answer without SQL https://arxiv.org/abs/1902.04260
-
-
开箱即用
-
Chinese-NER-InjectDictRule Public
named entity recognition combined with rule from entity dict
-
baidubaike_scrapy Public
爬取百度百科数据,用于BERT预训练
-
-



