San Kim

0 Followers

deep learning language modeling machine learning seminar #natural language processing bert nlp paper multi-class classification xlnet transformer-xl ai2 gan pytorch classification reverse kl divergence probability parameter regularization parameter of distribution image-to-image transformation face recognition face verification transformer adaptive softmax input representations position embeddings relative position embeddings back propagation learning rate local gradient gradient update optimization neural network full connected layer bayes's theorem baysian inference cross entropy curse of dimensionality entropy exponential family forward kl divergence information theory jensen-shannon divergence kullback-leibler divergence logistic sigmoid map maximum entropy distribution mle mode collapsing fever dense retrieval temporal reasoning temporal dataset implicit temporal events ai language language models dimensionality fine-tuning code generation alphacode acl text generation encoder decoder model long context conference emnlp fine tuning fast adaptation efficient fine-tuning llms roberta réformer pretrained mdoel electra replaced token detection pretrained model commonsense reasoning abductive commonsense reasoning nli nlg abductive dataset iclr iclr 2020 gpt #zero-shot learning #multi task #gpt3 #unified question answering multi-hop qa hotpotqa

Activity
About

San Kim

Presentations

Back propagation

Deep learning study 1

Deep learning study 2

Deep learning study 3

Gan seminar

Face recognition v1

Transformer xl

XLnet RoBERTa Reformer

Electra

Abductive commonsense reasoning

Measuring massive multitask language understanding

Answering complex open domain questions with multi-hop dense retrieval

Temporal reasoning task

AI2 day.pptx

Intrinsic Dimensionality Explains the Effectiveness of Language Model Fine-Tuning.pptx

Compeition-Level Code Generation with AlphaCode.pptx

slide-acl2022-combined_san.pptx

LongT5_Efficient Text-toText Transformer for Long Sequences_san.pptx

2023 EMNLP day_san.pptx

20230419-LLaMA-Adapter_ Efficient Fine-tuning of Language Models with Zero-init Attention_san.pptx

Likes

Dependency Parser, 의존 구조 분석기

안.전.제.일. 강화학습!

Accelerating TensorFlow with RDMA for high-performance deep learning

Bayesian Inference : Kalman filter 에서 Optimization 까지 - 김홍배 박사님

Deep learning - A Visual Introduction