GitHub - SKDDJ/cv-arxiv-daily: 🎓Automatically Update CV Papers Daily using Github Actions (Update Every 12th hours)

Updated on 2025.05.29

Table of Contents

PEFT
Text-to-Image Generation
Vision-Language Models
Generative Weight Space Modeling
Data Distillation
Schrodinger Bridge
Dataset Distillation
Synthetic Data Generation

PEFT

Publish Date	Title	Authors	PDF	Code
2025-05-26	GraLoRA: Granular Low-Rank Adaptation for Parameter-Efficient Fine-Tuning	Yeonjoon Jung et.al.	2505.20355	null
2025-05-26	Parameter-Efficient Fine-Tuning with Column Space Projection	Junseo Hwang et.al.	2505.20211	null
2025-05-26	UORA: Uniform Orthogonal Reinitialization Adaptation in Parameter-Efficient Fine-Tuning of Large Models	Xueyan Zhang et.al.	2505.20154	null
2025-05-25	Optimization-Inspired Few-Shot Adaptation for Large Language Models	Boyan Gao et.al.	2505.19107	null
2025-05-27	Universal Reasoner: A Single, Composable Plug-and-Play Reasoner for Frozen LLMs	Jaemin Kim et.al.	2505.19075	link
2025-05-24	HD-PiSSA: High-Rank Distributed Orthogonal Adaptation	Yiding Wang et.al.	2505.18777	null
2025-05-24	AuroRA: Breaking Low-Rank Bottleneck of LoRA with Nonlinear Mapping	Haonan Dong et.al.	2505.18738	null
2025-05-24	LLM-QFL: Distilling Large Language Model for Quantum Federated Learning	Dev Gurung et.al.	2505.18656	null
2025-05-24	Knowledge Grafting of Large Language Models	Guodong Du et.al.	2505.18502	null
2025-05-22	Representation Discrepancy Bridging Method for Remote Sensing Image-Text Retrieval	Hailong Ning et.al.	2505.16756	null
2025-05-28	Larger Is Not Always Better: Exploring Small Open-source Language Models in Logging Statement Generation	Renyi Zhong et.al.	2505.16590	null
2025-05-21	VP Lab: a PEFT-Enabled Visual Prompting Laboratory for Semantic Segmentation	Niccolo Avogaro et.al.	2505.15592	null
2025-05-21	CoLA: Collaborative Low-Rank Adaptation	Yiyun Zhou et.al.	2505.15471	link
2025-05-21	Gated Integration of Low-Rank Adaptation for Continual Learning of Language Models	Yan-Shuo Liang et.al.	2505.15424	link
2025-05-21	Parameter-Efficient Fine-Tuning of Multispectral Foundation Models for Hyperspectral Image Classification	Bernardin Ligan et.al.	2505.15334	null
2025-05-21	Few-Shot Adversarial Low-Rank Fine-Tuning of Vision-Language Models	Sajjad Ghiasvand et.al.	2505.15130	null
2025-05-21	Dual Decomposition of Weights and Singular Value Low Rank Adaptation	Jialong Han et.al.	2505.14367	null
2025-05-21	OSoRA: Output-Dimension and Singular-Value Initialized Low-Rank Adaptation	Jialong Han et.al.	2505.14350	null
2025-05-23	ABBA: Highly Expressive Hadamard Product Adaptation for Large Language Models	Raghav Singhal et.al.	2505.14238	link
2025-05-18	Adaptive parameter-efficient fine-tuning via Hessian-informed subset selection	Shiyun Xu et.al.	2505.12579	null
2025-05-18	Exploring Sparsity for Parameter Efficient Fine Tuning Using Wavelets	Ahmet Bilican et.al.	2505.12532	link
2025-05-18	SRLoRA: Subspace Recomposition in Low-Rank Adaptation via Importance-Based Fusion and Reinitialization	Haodong Yang et.al.	2505.12433	null
2025-05-16	Memory-Efficient Orthogonal Fine-Tuning with Principal Subspace Adaptation	Fei Wu et.al.	2505.11235	null
2025-05-15	Multi-Token Prediction Needs Registers	Anastasios Gerontopoulos et.al.	2505.10518	link
2025-05-14	PT-MoE: An Efficient Finetuning Framework for Integrating Mixture-of-Experts into Prompt Tuning	Zongqian Li et.al.	2505.09519	link
2025-05-13	Parameter-Efficient Fine-Tuning of Vision Foundation Model for Forest Floor Segmentation from UAV Imagery	Mohammad Wasil et.al.	2505.08932	link
2025-05-10	Efficient Telecom Specific LLM: TSLAM-Mini with QLoRA and Digital Twin Data	Vignesh Ethiraj et.al.	2505.07877	null
2025-05-11	DAPE: Dual-Stage Parameter-Efficient Fine-Tuning for Consistent Video Editing with Diffusion Models	Junhao Xia et.al.	2505.07057	null
2025-05-10	Enfoque Odychess: Un método dialéctico, constructivista y adaptativo para la enseñanza del ajedrez con inteligencias artificiales generativas	Ernesto Giralt Hernandez et.al.	2505.06652	null
2025-05-07	GAPrompt: Geometry-Aware Point Cloud Prompt for 3D Vision Model	Zixiang Ai et.al.	2505.04119	link
2025-05-05	HSplitLoRA: A Heterogeneous Split Parameter-Efficient Fine-Tuning Framework for Large Language Models	Zheng Lin et.al.	2505.02795	null
2025-05-05	Parameter-Efficient Fine-Tuning with Attributed Patch Semantic Graph for Automated Patch Correctness Assessment	Zhenyu Yang et.al.	2505.02629	link
2025-05-01	AdCare-VLM: Leveraging Large Vision Language Model (LVLM) to Monitor Long-Term Medication Adherence and Care	Md Asaduzzaman Jabin et.al.	2505.00275	link
2025-04-30	Enhancing Health Mention Classification Performance: A Study on Advancements in Parameter Efficient Tuning	Reem Abdel-Salam et.al.	2504.21685	null
2025-05-09	A Systematic Literature Review of Parameter-Efficient Fine-Tuning for Large Code Models	Md Zahidul Haque et.al.	2504.21569	link
2025-04-29	TT-LoRA MoE: Unifying Parameter-Efficient Fine-Tuning and Sparse Mixture-of-Experts	Pradip Kunwar et.al.	2504.21190	null
2025-04-29	A Survey on Parameter-Efficient Fine-Tuning for Foundation Models in Federated Learning	Jieming Bian et.al.	2504.21099	null
2025-04-29	ReCIT: Reconstructing Full Private Data from Gradient in Parameter-Efficient Fine-Tuning of Large Language Models	Jin Xie et.al.	2504.20570	null
2025-04-23	Parameter-Efficient Checkpoint Merging via Metrics-Weighted Averaging	Shi Jie Yu et.al.	2504.18580	null
2025-04-24	Fine-tune Smarter, Not Harder: Parameter-Efficient Fine-Tuning for Geospatial Foundation Models	Francesc Marti-Escofet et.al.	2504.17397	null
2025-04-22	PointLoRA: Low-Rank Adaptation with Token Selection for Point Cloud Learning	Song Wang et.al.	2504.16023	null
2025-04-21	What Lurks Within? Concept Auditing for Shared Diffusion Models at Scale	Xiaoyong Yuan et.al.	2504.14815	null
2025-04-20	Harnessing Generative LLMs for Enhanced Financial Event Entity Extraction Performance	Soo-joon Choi et.al.	2504.14633	null
2025-04-20	Vision-Centric Representation-Efficient Fine-Tuning for Robust Universal Foreground Segmentation	Guoyi Zhang et.al.	2504.14481	null
2025-04-19	PEFT A2Z: Parameter-Efficient Fine-Tuning Survey for Large Language and Vision Models	Nusrat Jahan Prottasha et.al.	2504.14117	null
2025-04-18	Parameter-Efficient Continual Fine-Tuning: A Survey	Eric Nuertey Coleman et.al.	2504.13822	null
2025-04-17	All-in-One Transferring Image Compression from Human Perception to Multi-Machine Perception	Jiancheng Zhao et.al.	2504.12997	null
2025-04-15	A Decade of Wheat Mapping for Lebanon	Hasan Wehbi et.al.	2504.11366	null
2025-04-14	CROSSAN: Towards Efficient and Effective Adaptation of Multiple Multimodal Foundation Models for Sequential Recommendation	Junchen Fu et.al.	2504.10307	link
2025-04-10	LoRI: Reducing Cross-Task Interference in Multi-Task Low-Rank Adaptation	Juzheng Zhang et.al.	2504.07448	link
2025-04-14	DUKAE: DUal-level Knowledge Accumulation and Ensemble for Pre-Trained Model-Based Continual Learning	Songze Li et.al.	2504.06521	null
2025-04-16	Earth-Adapter: Bridge the Geospatial Domain Gaps with Mixture of Frequency Adaptation	Xiaoxing Hu et.al.	2504.06220	link
2025-04-11	AROMA: Autonomous Rank-one Matrix Adaptation	Hao Nan Sheng et.al.	2504.05343	link
2025-04-05	FISH-Tuning: Enhancing PEFT Methods with Fisher Information	Kang Xue et.al.	2504.04050	null
2025-04-02	CLIP-SLA: Parameter-Efficient CLIP Adaptation for Continuous Sign Language Recognition	Sarah Alyami et.al.	2504.01666	link
2025-04-01	Generalized Tensor-based Parameter-Efficient Fine-Tuning via Lie Group Transformations	Chongjie Si et.al.	2504.00851	null
2025-04-01	DynMoLE: Boosting Mixture of LoRA Experts Fine-Tuning with a Hybrid Routing Mechanism	Dengchun Li et.al.	2504.00661	link
2025-03-31	ElaLoRA: Elastic & Learnable Low-Rank Adaptation for Efficient Model Fine-Tuning	Huandong Chang et.al.	2504.00254	null
2025-03-31	Order Matters: On Parameter-Efficient Image-to-Video Probing for Recognizing Nearly Symmetric Actions	Thinesh Thiyakesan Ponbagavathi et.al.	2503.24298	null
2025-03-29	Efficient Adaptation For Remote Sensing Visual Grounding	Hasan Moughnieh et.al.	2503.23083	null
2025-03-27	MSPLoRA: A Multi-Scale Pyramid Low-Rank Adaptation for Efficient Model Fine-Tuning	Jiancheng Zhao et.al.	2503.21838	link
2025-03-26	Enhancing Multi-modal Models with Heterogeneous MoE Adapters for Fine-tuning	Sashuai Zhou et.al.	2503.20633	null
2025-03-26	IAP: Improving Continual Learning of Vision-Language Models via Instance-Aware Prompting	Hao Fu et.al.	2503.20612	link
2025-03-26	Unlocking the Hidden Potential of CLIP in Generalizable Deepfake Detection	Andrii Yermakov et.al.	2503.19683	link
2025-03-25	VectorFit : Adaptive Singular & Bias Vector Fine-Tuning of Pre-trained Foundation Models	Suhas G Hegde et.al.	2503.19530	null
2025-03-24	MoST: Efficient Monarch Sparse Tuning for 3D Representation Learning	Xu Han et.al.	2503.18368	link
2025-03-24	Coeff-Tuning: A Graph Filter Subspace View for Tuning Attention-Based Large Models	Zichen Miao et.al.	2503.18337	null
2025-03-23	Decoupling Angles and Strength in Low-rank Adaptation	Massimo Bini et.al.	2503.18225	link
2025-03-22	Visual Variational Autoencoder Prompt Tuning	Xi Xiao et.al.	2503.17650	null
2025-03-21	PE-CLIP: A Parameter-Efficient Fine-Tuning of Vision Language Models for Dynamic Facial Expression Recognition	Ibtissam Saadi et.al.	2503.16945	null
2025-03-20	VP-NTK: Exploring the Benefits of Visual Prompting in Differentially Private Data Synthesis	Chia-Yi Hsu et.al.	2503.16195	null
2025-03-20	SALT: Singular Value Adaptation with Low-Rank Transformation	Abdelrahman Elsayed et.al.	2503.16055	link
2025-03-19	FedSCA: Federated Tuning with Similarity-guided Collaborative Aggregation for Heterogeneous Medical Image Segmentation	Yumin Zhang et.al.	2503.15390	null
2025-03-18	MAST-Pro: Dynamic Mixture-of-Experts for Adaptive Segmentation of Pan-Tumors with Knowledge-Driven Prompts	Runqi Meng et.al.	2503.14355	null
2025-03-15	A Survey on Federated Fine-tuning of Large Language Models	Yebo Wu et.al.	2503.12016	link
2025-03-14	Rethinking Few-Shot Adaptation of Vision-Language Models in Two Stages	Matteo Farina et.al.	2503.11609	link
2025-03-14	MoLEx: Mixture of Layer Experts for Finetuning with Sparse Upcycling	Rachel S. Y. Teo et.al.	2503.11144	link
2025-03-13	Efficient Federated Fine-Tuning of Large Language Models with Layer Dropout	Shilong Wang et.al.	2503.10217	null
2025-03-13	Singular Value Fine-tuning for Few-Shot Class-Incremental Learning	Zhiwu Wang et.al.	2503.10214	null
2025-03-12	Revisiting semi-supervised learning in the era of foundation models	Ping Zhang et.al.	2503.09707	link
2025-03-11	1LoRA: Summation Compression for Very Low-Rank Adaptation	Alessio Quercia et.al.	2503.08333	null
2025-03-11	Adapting Large Language Models for Parameter-Efficient Log Anomaly Detection	Ying Fu Lim et.al.	2503.08045	null
2025-03-09	MoFE: Mixture of Frozen Experts Architecture	Jean Seo et.al.	2503.06491	null
2025-03-08	Lifelong Learning with Task-Specific Adaptation: Addressing the Stability-Plasticity Dilemma	Ruiyu Wang et.al.	2503.06213	null
2025-03-07	Quantum-PEFT: Ultra parameter-efficient fine-tuning	Toshiaki Koike-Akino et.al.	2503.05431	null
2025-03-07	Personalized Text Generation with Contrastive Activation Steering	Jinghao Zhang et.al.	2503.05213	null
2025-03-06	TableLoRA: Low-rank Adaptation on Table Structure Understanding for Large Language Models	Xinyi He et.al.	2503.04396	null
2025-03-05	State-offset Tuning: State-based Parameter-Efficient Fine-Tuning for State Space Models	Wonjun Kang et.al.	2503.03499	link
2025-03-11	PaCA: Partial Connection Adaptation for Efficient Fine-Tuning	Sunghyeon Woo et.al.	2503.01905	null
2025-03-03	Parameter-Efficient Fine-Tuning of Large Language Models via Deconvolution in Subspace	Jia-Chen Zhang et.al.	2503.01419	null
2025-03-03	PROPER: A Progressive Learning Framework for Personalized Large Language Models with Group-Level Adaptation	Linhai Zhang et.al.	2503.01303	null
2025-03-03	Beyond QA Pairs: Assessing Parameter-Efficient Fine-Tuning for Fact Embedding in LLMs	Shivam Ratnakar et.al.	2503.01131	null
2025-03-09	Re-Imagining Multimodal Instruction Tuning: A Representation View	Yiyang Liu et.al.	2503.00723	link
2025-02-27	MobiLLM: Enabling LLM Fine-Tuning on the Mobile Device via Server Assisted Side Tuning	Liang Li et.al.	2502.20421	null
2025-02-26	LORENZA: Enhancing Generalization in Low-Rank Gradient LLM Training via Efficient Zeroth-Order Adaptive SAM	Yehonathan Refael et.al.	2502.19571	null
2025-02-22	ELBA-Bench: An Efficient Learning Backdoor Attacks Benchmark for Large Language Models	Xuxu Liu et.al.	2502.18511	null
2025-03-04	SECURA: Sigmoid-Enhanced CUR Decomposition with Uninterrupted Retention and Low-Rank Adaptation in Large Language Models	Yuxuan Zhang et.al.	2502.18168	null
2025-02-21	Sparsity May Be All You Need: Sparse Random Parameter Adaptation	Jesus Rios et.al.	2502.15975	link
2025-02-19	Black Sheep in the Herd: Playing with Spuriously Correlated Attributes for Vision-Language Recognition	Xinyu Tian et.al.	2502.15809	null
2025-02-21	R-LoRA: Random Initialization of Multi-Head LoRA for Multi-Task Learning	Jinda Liu et.al.	2502.15455	link
2025-02-20	Generative Modeling of Individual Behavior at Scale	Nabil Omi et.al.	2502.14998	null
2025-02-20	LoRA-GGPO: Mitigating Double Descent in LoRA Fine-Tuning via Gradient-Guided Perturbation Optimization	Yupeng Chang et.al.	2502.14538	link
2025-02-20	NLoRA: Nyström-Initiated Low-Rank Adaptation for Large Language Models	Chenlu Guo et.al.	2502.14482	link
2025-02-21	Token Adaptation via Side Graph Convolution for Efficient Fine-tuning of 3D Point Cloud Transformers	Takahiko Furuya et.al.	2502.14142	link
2025-02-19	LSR-Adapt: Ultra-Efficient Parameter Tuning with Matrix Low Separation Rank Kernel Adaptation	Xin Li et.al.	2502.13568	null
2025-02-24	GSQ-Tuning: Group-Shared Exponents Integer in Fully Quantized Training for LLMs On-Device Fine-tuning	Sifan Zhou et.al.	2502.12913	null
2025-02-17	Mitigating Visual Knowledge Forgetting in MLLM Instruction-tuning via Modality-decoupled Gradient Descent	Junda Wu et.al.	2502.11740	null
2025-02-13	DiffoRA: Enabling Parameter-Efficient LLM Fine-Tuning via Differential Low-Rank Matrix Adaptation	Tangyu Jiang et.al.	2502.08905	null
2025-02-12	LowRA: Accurate and Efficient LoRA Fine-Tuning of LLMs under 2 Bits	Zikai Zhou et.al.	2502.08141	null
2025-02-12	Music for All: Exploring Multicultural Representations in Music Generation Models	Atharva Mehta et.al.	2502.07328	link
2025-02-10	Model Diffusion for Certifiable Few-shot Transfer Learning	Fady Rezk et.al.	2502.06970	null
2025-02-10	Hyper Compressed Fine-Tuning of Large Foundation Models with Quantum Inspired Adapters	Snehal Raj et.al.	2502.06916	null
2025-02-10	KARST: Multi-Kernel Kronecker Adaptation with Re-Scaling Transmission for Visual Classification	Yue Zhu et.al.	2502.06779	null
2025-02-10	FunduSAM: A Specialized Deep Learning Model for Enhanced Optic Disc and Cup Segmentation in Fundus Images	Jinchen Yu et.al.	2502.06220	null
2025-02-08	SSH: Sparse Spectrum Adaptation via Discrete Hartley Transformation	Yixian Shen et.al.	2502.05539	null
2025-02-07	SSMLoRA: Enhancing Low-Rank Adaptation with State Space Model	Jiayang Yu et.al.	2502.04958	link
2025-02-05	FedP $^2$ EFT: Federated Learning to Personalize Parameter Efficient Fine-Tuning for Multilingual LLMs	Royson Lee et.al.	2502.04387	null
2025-02-06	Rank Also Matters: Hierarchical Configuration for Mixture of Adapter Experts in LLM Fine-Tuning	Peizhuang Cong et.al.	2502.03884	null
2025-02-05	Bilevel ZOFO: Bridging Parameter-Efficient and Zeroth-Order Techniques for Efficient LLM Fine-Tuning and Meta-Training	Reza Shirkavand et.al.	2502.03604	null
2025-02-05	RepLoRA: Reparameterizing Low-Rank Adaptation via the Perspective of Mixture of Experts	Tuan Truong et.al.	2502.03044	null
2025-02-13	Robust Federated Finetuning of LLMs via Alternating Optimization of LoRA	Shuangyi Chen et.al.	2502.01755	null
2025-02-03	Joint Localization and Activation Editing for Low-Resource Fine-Tuning	Wen Lai et.al.	2502.01179	link
2025-02-03	PARA: Parameter-Efficient Fine-tuning with Prompt Aware Representation Adjustment	Zequan Liu et.al.	2502.01033	null
2025-02-01	Parameter Efficient Fine-Tuning of Segment Anything Model	Carolin Teuber et.al.	2502.00418	link
2025-02-01	Sparse Gradient Compression for Fine-Tuning Large Language Models	David H. Yang et.al.	2502.00311	null
2025-01-30	Enhancing Large Language Model Efficiencyvia Symbolic Compression: A Formal Approach Towards Interpretability	Lumen AI et.al.	2501.18657	null
2025-01-23	Low-Rank Adapters Meet Neural Architecture Search for LLM Compression	J. Pablo Muñoz et.al.	2501.16372	link
2025-01-26	Fine Tuning without Catastrophic Forgetting via Selective Low Rank Adaptation	Reza Akbarian Bafghi et.al.	2501.15377	null
2025-02-09	Decentralized Low-Rank Fine-Tuning of Large Language Models	Sajjad Ghiasvand et.al.	2501.15361	null
2025-01-25	Complementary Subspace Low-Rank Adaptation of Vision-Language Models for Few-Shot Classification	Zhongqi Wang et.al.	2501.15040	null
2025-01-24	Domain Expansion: Parameter-Efficient Modules as Building Blocks for Composite Domains	Mann Patel et.al.	2501.14321	link
2025-01-23	Parameter-Efficient Fine-Tuning for Foundation Models	Dan Zhang et.al.	2501.13787	link
2025-01-21	EDoRA: Efficient Weight-Decomposed Low-Rank Adaptation via Singular Value Decomposition	Hamid Nasiri et.al.	2501.12067	link
2025-01-21	Is your LLM trapped in a Mental Set? Investigative study on how mental sets affect the reasoning capabilities of LLMs	Saiful Haq et.al.	2501.11833	null
2025-01-17	OMoE: Diversifying Mixture of Low-Rank Adaptation by Orthogonal Finetuning	Jinyuan Feng et.al.	2501.10062	null
2025-01-15	Transformed Low-rank Adaptation via Tensor Decomposition and Its Applications to Text-to-image Models	Zerui Tao et.al.	2501.08727	null
2025-01-14	TriAdaptLoRA: Brain-Inspired Triangular Adaptive Low-Rank Adaptation for Parameter-Efficient Fine-Tuning	Yao Liang et.al.	2501.08008	null
2025-01-14	Optimizing Language Models for Grammatical Acceptability: A Comparative Study of Fine-Tuning Techniques	Shobhit Ratan et.al.	2501.07853	null
2025-01-12	A Hessian-informed hyperparameter optimization for differential learning rate	Shiyun Xu et.al.	2501.06954	null
2025-01-10	Aggregating Low Rank Adapters in Federated Fine-tuning	Evelyn Trautmann et.al.	2501.06332	null
2025-01-10	How to Tune a Multilingual Encoder Model for Germanic Languages: A Study of PEFT, Full Fine-Tuning, and Language Adapters	Romina Oji et.al.	2501.06025	link
2025-01-08	TADFormer : Task-Adaptive Dynamic Transformer for Efficient Multi-Task Learning	Seungmin Baek et.al.	2501.04293	null
2025-01-20	Spectral-Aware Low-Rank Adaptation for Speaker Verification	Zhe Li et.al.	2501.03829	link
2025-01-06	ADePT: Adaptive Decomposed Prompt Tuning for Parameter-Efficient Fine-tuning	Pengwei Tang et.al.	2501.03291	link
2025-01-05	HALO: Hadamard-Assisted Lossless Optimization for Efficient Low-Precision LLM Training and Fine-Tuning	Saleh Ashkboos et.al.	2501.02625	link
2025-01-05	Efficient Deployment of Large Language Models on Resource-constrained Devices	Zhiwei Yao et.al.	2501.02438	null
2025-01-09	tCURLoRA: Tensor CUR Decomposition Based Low-Rank Parameter Adaptation and Its Application in Medical Image Segmentation	Guanghua He et.al.	2501.02227	null
2025-01-03	SaLoRA: Safety-Alignment Preserved Low-Rank Adaptation	Mingjie Li et.al.	2501.01765	null
2025-01-07	Practical Secure Inference Algorithm for Fine-tuned Large Language Model Based on Fully Homomorphic Encryption	Zhang Ruoyan et.al.	2501.01672	null
2024-12-30	Disentangling Preference Representation and Text Generation for Efficient Individual Preference Alignment	Jianfei Zhang et.al.	2412.20834	link
2024-12-28	VELoRA: A Low-Rank Adaptation Approach for Efficient RGB-Event based Recognition	Lan Chen et.al.	2412.20064	link
2025-01-05	Gradient Weight-normalized Low-rank Projection for Efficient LLM Training	Jia-Hong Huang et.al.	2412.19616	link
2024-12-27	Parameter Efficient Fine-Tuning for Deep Learning-Based Full-Waveform Inversion	Koustav Ghosal et.al.	2412.19510	null
2024-12-24	Multi-Point Positional Insertion Tuning for Small Object Detection	Kanoko Goto et.al.	2412.18090	null
2024-12-23	Interweaving Memories of a Siamese Large Language Model	Xin Song et.al.	2412.17383	link
2024-12-26	LLMsAgainstHate @ NLU of Devanagari Script Languages 2025: Hate Speech Detection and Target Identification in Devanagari Languages via Parameter Efficient Fine-Tuning of LLMs	Rushendra Sidibomma et.al.	2412.17131	link
2024-12-21	Label Privacy in Split Learning for Large Models with Parameter-Efficient Training	Philip Zmushko et.al.	2412.16669	link
2024-12-19	FedPIA -- Permuting and Integrating Adapters leveraging Wasserstein Barycenters for Finetuning Foundation Models in Multi-Modal Federated Learning	Pramit Saha et.al.	2412.14424	null
2024-12-18	Parameter-efficient Fine-tuning for improved Convolutional Baseline for Brain Tumor Segmentation in Sub-Saharan Africa Adult Glioma Dataset	Bijay Adhikari et.al.	2412.14100	link
2024-12-18	A Comprehensive Evaluation of Parameter-Efficient Fine-Tuning on Method-Level Code Smell Detection	Beiqi Zhang et.al.	2412.13801	link
2024-12-18	Refining Salience-Aware Sparse Fine-Tuning Strategies for Language Models	Xinxin Liu et.al.	2412.13488	null
2024-12-17	Train More Parameters But Mind Their Placement: Insights into Language Adaptation with PEFT	Jenny Kunz et.al.	2412.12674	link
2024-12-16	Visual Instruction Tuning with 500x Fewer Parameters through Modality Linear Representation-Steering	Jinhe Bi et.al.	2412.12359	link
2024-12-16	A LoRA is Worth a Thousand Pictures	Chenxi Liu et.al.	2412.12048	null
2024-12-11	Adaptive Principal Components Allocation with the $\ell_{2,g}$ -regularized Gaussian Graphical Model for Efficient Fine-Tuning Large Models	Jingjing Zheng et.al.	2412.08592	link
2024-12-10	PETALface: Parameter Efficient Transfer Learning for Low-resolution Face Recognition	Kartik Narayan et.al.	2412.07771	null
2024-12-10	MoDULA: Mixture of Domain-Specific and Universal LoRA for Multi-Task Learning	Yufei Ma et.al.	2412.07405	null
2024-12-13	Crack-EdgeSAM Self-Prompting Crack Segmentation System for Edge Devices	Yingchu Wang et.al.	2412.07205	null
2024-12-08	Taming Sensitive Weights : Noise Perturbation Fine-tuning for Robust LLM Quantization	Dongwei Wang et.al.	2412.06858	null
2024-12-09	BoRA: Bi-dimensional Weight-Decomposed Low-Rank Adaptation	Qiushi Wang et.al.	2412.06441	null
2024-12-19	S $^{2}$ FT: Efficient, Scalable and Generalizable LLM Fine-tuning by Structured Sparsity	Xinyu Yang et.al.	2412.06289	null
2024-12-08	KaSA: Knowledge-Aware Singular-Value Adaptation of Large Language Models	Fan Wang et.al.	2412.06071	link
2024-12-07	Training-Free Bayesianization for Low-Rank Adapters of Large Language Models	Haizhou Shi et.al.	2412.05723	link
2024-12-06	PETapter: Leveraging PET-style classification heads for modular few-shot parameter-efficient fine-tuning	Jonas Rieger et.al.	2412.04975	null
2024-12-04	Prompting Large Language Models for Clinical Temporal Relation Extraction	Jianping He et.al.	2412.04512	null
2024-12-05	SoRA: Singular Value Decomposed Low-Rank Adaptation for Domain Generalizable Representation Learning	Seokju Yun et.al.	2412.04077	link
2024-12-04	Improving Linguistic Diversity of Large Language Models with Possibility Exploration Fine-Tuning	Long Mai et.al.	2412.03343	link
2024-12-03	Mixture of Physical Priors Adapter for Parameter-Efficient Fine-Tuning	Zhaozhi Wang et.al.	2412.02759	null
2024-12-03	CPP-UT-Bench: Can LLMs Write Complex Unit Tests in C++?	Vaishnavi Bhargava et.al.	2412.02735	null
2024-12-03	LoRA Diffusion: Zero-Shot LoRA Synthesis for Diffusion Model Personalization	Ethan Smith et.al.	2412.02352	null
2024-12-03	A Comprehensive Evaluation of Large Language Models on Aspect-Based Sentiment Analysis	Changzhi Zhou et.al.	2412.02279	null
2024-11-30	Unified Parameter-Efficient Unlearning for LLMs	Chenlu Ding et.al.	2412.00383	link
2024-11-29	SURE-VQA: Systematic Understanding of Robustness Evaluation in Medical VQA Tasks	Kim-Celine Kahl et.al.	2411.19688	link
2024-11-28	Parameter-Efficient Transfer Learning for Music Foundation Models	Yiwei Ding et.al.	2411.19371	link
2024-11-28	PEFT-as-an-Attack! Jailbreaking Language Models during Federated Parameter-Efficient Fine-Tuning	Shenghui Li et.al.	2411.19335	null
2024-11-28	Enhancing Parameter-Efficient Fine-Tuning of Vision Transformers through Frequency-Based Adaptation	Son Thai Ly et.al.	2411.19297	link
2024-11-27	Challenges in Adapting Multilingual LLMs to Low-Resource Languages using LoRA PEFT Tuning	Omkar Khade et.al.	2411.18571	null
2024-11-26	PEFTGuard: Detecting Backdoor Attacks Against Parameter-Efficient Fine-Tuning	Zhen Sun et.al.	2411.17453	null
2024-11-29	Promptable Anomaly Segmentation with SAM Through Self-Perception Tuning	Hui-Yue Yang et.al.	2411.17217	null
2024-11-25	Towards Efficient Model-Heterogeneity Federated Learning for Large Models	Ruofan Jia et.al.	2411.16796	null
2024-11-25	Parameter Efficient Instruction Tuning: An Empirical Study	Pengfei He et.al.	2411.16775	link
2024-11-25	Graph Adapter of EEG Foundation Models for Parameter Efficient Fine Tuning	Toyotaro Suzumura et.al.	2411.16155	null
2024-11-24	Efficient and Private: Memorisation under differentially private parameter-efficient fine-tuning in language models	Olivia Ma et.al.	2411.15831	null
2024-11-21	Parameter Efficient Mamba Tuning via Projector-targeted Diagonal-centric Linear Transformation	Seokil Ham et.al.	2411.15224	null
2024-11-22	LoRA-FAIR: Federated LoRA Fine-Tuning with Aggregation and Initialization Refinement	Jieming Bian et.al.	2411.14961	null
2024-11-21	Multi LoRA Meets Vision: Merging multiple adapters to create a multi task model	Ege Kesim et.al.	2411.14064	null
2024-11-17	F $^3$ OCUS -- Federated Finetuning of Vision-Language Foundation Models with Optimal Client Layer Updating Strategy via Multi-objective Meta-Heuristics	Pramit Saha et.al.	2411.11912	null
2024-11-16	HELENE: Hessian Layer-wise Clipping and Gradient Annealing for Accelerating Fine-tuning LLM with Zeroth-order Optimization	Huaqin Zhao et.al.	2411.10696	null
2024-11-12	PERFT: Parameter-Efficient Routed Fine-Tuning for Mixture-of-Expert Model	Yilun Liu et.al.	2411.08212	null
2024-11-10	Prompt-Efficient Fine-Tuning for GPT-like Deep Models to Reduce Hallucination and to Improve Reproducibility in Scientific Text Generation Using Stochastic Optimisation Techniques	Daniil Sulimov et.al.	2411.06445	null
2024-11-06	MambaPEFT: Exploring Parameter-Efficient Fine-Tuning for Mamba	Masakazu Yoshimura et.al.	2411.03855	link
2024-11-04	PipeLLM: Fast and Confidential Large Language Model Services with Speculative Pipelined Encryption	Yifan Tan et.al.	2411.03357	null
2024-11-05	Efficient and Effective Adaptation of Multimodal Foundation Models in Sequential Recommendation	Junchen Fu et.al.	2411.02992	null
2024-11-04	Parameter-Efficient Fine-Tuning of Large Language Models for Unit Test Generation: An Empirical Study	André Storhaug et.al.	2411.02462	null
2024-11-04	Expanding Sparse Tuning for Low Memory Usage	Shufan Shen et.al.	2411.01800	link
2024-11-15	Visual Fourier Prompt Tuning	Runjia Zeng et.al.	2411.01327	link
2024-10-31	CleaR: Towards Robust and Generalized Parameter-Efficient Fine-Tuning for Noisy Label Learning	Yeachan Kim et.al.	2411.00873	null
2024-10-30	FPE-LLM: Highly Intelligent Time-Series Forecasting and Language Interaction LLM in Energy Systems	Zihang Qiu et.al.	2411.00852	null
2024-11-01	Dual Low-Rank Adaptation for Continual Learning with Pre-Trained Models	Huancheng Chen et.al.	2411.00623	null
2024-11-01	Is Multiple Object Tracking a Matter of Specialization?	Gianluca Mancusi et.al.	2411.00553	null
2024-11-01	C2A: Client-Customized Adaptation for Parameter-Efficient Federated Learning	Yeachan Kim et.al.	2411.00311	link
2024-10-29	Preserving Pre-trained Representation Space: On Effectiveness of Prefix-tuning for Large Multi-modal Models	Donghoon Kim et.al.	2411.00029	null
2024-10-30	Efficient Adaptation of Pre-trained Vision Transformer via Householder Transformation	Wei Dong et.al.	2410.22952	null
2024-10-30	MALoRA: Mixture of Asymmetric Low-Rank Adaptation for Enhanced Multi-Task Learning	Xujia Wang et.al.	2410.22782	null
2024-10-29	Meta-Learning Adaptable Foundation Models	Jacob L. Block et.al.	2410.22264	null
2024-10-29	Capacity Control is an Effective Memorization Mitigation Mechanism in Text-Conditional Diffusion Models	Raman Dutt et.al.	2410.22149	link
2024-10-30	IntLoRA: Integral Low-rank Adaptation of Quantized Diffusion Models	Hang Guo et.al.	2410.21759	link
2024-10-28	KD-LoRA: A Hybrid Approach to Efficient Fine-Tuning with LoRA and Knowledge Distillation	Rambod Azimi et.al.	2410.20777	link
2024-10-27	Get Large Language Models Ready to Speak: A Late-fusion Approach for Speech Generation	Maohao Shen et.al.	2410.20336	null
2024-11-01	Parameter-Efficient Fine-Tuning in Large Models: A Survey of Methodologies	Luping Wang et.al.	2410.19878	null
2024-10-23	MiLoRA: Efficient Mixture of Low-Rank Adaptation for Large Language Models Fine-tuning	Jingfan Zhang et.al.	2410.18035	null
2024-10-22	Towards Real Zero-Shot Camouflaged Object Segmentation without Camouflaged Annotations	Cheng Lei et.al.	2410.16953	null
2024-10-22	MoRE: Multi-Modal Contrastive Pre-training with Transformers on X-Rays, ECGs, and Diagnostic Report	Samrajya Thapa et.al.	2410.16239	link
2024-10-21	Natural GaLore: Accelerating GaLore for memory-efficient LLM Training and Fine-tuning	Arijit Das et.al.	2410.16029	link
2024-10-18	Unlearning Backdoor Attacks for LLMs with Weak-to-Strong Knowledge Distillation	Shuai Zhao et.al.	2410.14425	link
2024-10-17	LoLDU: Low-Rank Adaptation via Lower-Diag-Upper Decomposition for Parameter-Efficient Fine-Tuning	Yiming Shi et.al.	2410.13618	link
2024-10-16	Communication-Efficient and Tensorized Federated Fine-Tuning of Large Language Models	Sajjad Ghiasvand et.al.	2410.13097	null
2024-10-17	Prompt Compression for Large Language Models: A Survey	Zongqian Li et.al.	2410.12388	link
2024-10-15	Layer-wise Importance Matters: Less Memory for Better Performance in Parameter-efficient Fine-tuning of Large Language Models	Kai Yao et.al.	2410.11772	link
2024-10-15	LoKO: Low-Rank Kalman Optimizer for Online Fine-Tuning of Large Models	Hossein Abdi et.al.	2410.11551	null
2024-10-15	RoCoFT: Efficient Finetuning of Large Language Models with Row-Column Updates	Md Kowsher et.al.	2410.10075	link
2024-10-13	BiDoRA: Bi-level Optimization-Based Weight-Decomposed Low-Rank Adaptation	Peijia Qin et.al.	2410.09758	null
2024-10-12	Towards Efficient Visual-Language Alignment of the Q-Former for Visual Reasoning Tasks	Sungkyung Kim et.al.	2410.09489	link
2024-10-15	MTL-LoRA: Low-Rank Adaptation for Multi-Task Learning	Yaming Yang et.al.	2410.09437	link
2024-10-09	Parameter-Efficient Fine-Tuning via Selective Discrete Cosine Transform	Yixian Shen et.al.	2410.09103	null
2024-10-04	BIPEFT: Budget-Guided Iterative Search for Parameter Efficient Fine-Tuning of Large Pretrained Language Models	Aofei Chang et.al.	2410.09079	null
2024-10-11	Parameter-Efficient Fine-Tuning of State Space Models	Kevin Galim et.al.	2410.09016	link
2024-10-10	Parameter-Efficient Fine-Tuning in Spectral Domain for Point Cloud Learning	Dingkang Liang et.al.	2410.08114	link
2024-10-10	SLIM: Let LLM Learn More and Forget Less with Soft LoRA and Identity Mixture	Jiayi Han et.al.	2410.07739	null
2024-10-10	Enhancing Zeroth-order Fine-tuning for Language Models with Low-rank Structures	Yiming Chen et.al.	2410.07698	link
2024-10-09	SparseGrad: A Selective Method for Efficient Fine-tuning of MLP Layers	Viktoriia Chekalina et.al.	2410.07383	link
2024-10-09	Functional-level Uncertainty Quantification for Calibrated Fine-tuning on LLMs	Ruijia Niu et.al.	2410.06431	null
2024-10-08	Are Large Language Models State-of-the-art Quality Estimators for Machine Translation of User-generated Content?	Shenbin Qian et.al.	2410.06338	link
2024-10-15	LoRTA: Low Rank Tensor Adaptation of Large Language Models	Ignacio Hounie et.al.	2410.04060	null
2024-10-03	Llama SLayer 8B: Shallow Layers Hold the Key to Knowledge Injection	Tianxiang Chen et.al.	2410.02330	link
2024-10-02	TPP-LLM: Modeling Temporal Point Processes by Efficiently Fine-Tuning Large Language Models	Zefang Liu et.al.	2410.02062	link
2024-10-02	NEAT: Nonlinear Parameter-efficient Adaptation of Pre-trained Models	Yibo Zhong et.al.	2410.01870	null
2024-09-27	A GEN AI Framework for Medical Note Generation	Hui Yi Leong et.al.	2410.01841	null
2024-10-02	DLP-LoRA: Efficient Task-Specific LoRA Fusion with a Dynamic, Lightweight Plugin for Large Language Models	Yuxuan Zhang et.al.	2410.01497	link
2024-10-01	PrivTuner with Homomorphic Encryption and LoRA: A P3EFT Scheme for Privacy-Preserving Parameter-Efficient Fine-Tuning of AI Foundation Models	Yang Li et.al.	2410.00433	null
2024-09-30	Adapting LLMs for the Medical Domain in Portuguese: A Study on Fine-Tuning and Model Evaluation	Pedro Henrique Paiola et.al.	2410.00163	null
2024-09-30	Resource Allocation for Stable LLM Training in Mobile Edge Computing	Chang Liu et.al.	2409.20247	null
2024-09-30	Reference Trustable Decoding: A Training-Free Augmentation Paradigm for Large Language Models	Luohe Shi et.al.	2409.20181	link
2024-09-28	FINE: Factorizing Knowledge for Initialization of Variable-sized Diffusion Models	Yucheng Xie et.al.	2409.19289	null
2024-10-01	Backdoor Attacks for LLMs with Weak-To-Strong Knowledge Distillation	Shuai Zhao et.al.	2409.17946	null
2024-09-26	PEDRO: Parameter-Efficient Fine-tuning with Prompt DEpenDent Representation MOdification	Tianfang Xie et.al.	2409.17834	null
2024-09-30	Efficient In-Domain Question Answering for Resource-Constrained Environments	Isaac Chung et.al.	2409.17648	null
2024-10-07	PACE: marrying generalization in PArameter-efficient fine-tuning with Consistency rEgularization	Yao Ni et.al.	2409.17137	link
2024-09-25	Parameter-efficient Bayesian Neural Networks for Uncertainty-aware Depth Estimation	Richard D. Paul et.al.	2409.17085	null
2024-10-02	Bone: Block Affine Transformation as Parameter Efficient Fine-tuning Methods for Large Language Models	Jiale Kang et.al.	2409.15371	link
2024-09-22	Flat-LoRA: Low-Rank Adaption over a Flat Loss Landscape	Tao Li et.al.	2409.14396	null
2024-10-01	Obliviate: Neutralizing Task-agnostic Backdoors within the Parameter-efficient Fine-tuning Paradigm	Jaehan Kim et.al.	2409.14119	link
2024-09-20	HUT: A More Computation Efficient Fine-Tuning Method With Hadamard Updated Transformation	Geyuan Zhang et.al.	2409.13501	null
2024-09-17	THaMES: An End-to-End Tool for Hallucination Mitigation and Evaluation in Large Language Models	Mengfei Liang et.al.	2409.11353	link
2024-09-17	LPT++: Efficient Training on Mixture of Long-tailed Experts	Bowen Dong et.al.	2409.11323	null
2024-09-17	Beyond LoRA: Exploring Efficient Fine-Tuning Techniques for Time Series Foundational Models	Divij Gupta et.al.	2409.11302	null
2024-09-18	Propulsion: Steering LLM with Tiny Fine-Tuning	Md Kowsher et.al.	2409.10927	link
2024-09-16	From Text to Emoji: How PEFT-Driven Personality Manipulation Unleashes the Emoji Potential in LLMs	Navya Jain et.al.	2409.10245	null
2024-09-14	COMFORT: A Continual Fine-Tuning Framework for Foundation Models Targeted at Consumer Healthcare	Chia-Hao Li et.al.	2409.09549	null
2024-09-14	Comparing Retrieval-Augmentation and Parameter-Efficient Fine-Tuning for Privacy-Preserving Personalization of Large Language Models	Alireza Salemi et.al.	2409.09510	link
2024-09-13	Risks When Sharing LoRA Fine-Tuned Diffusion Model Weights	Dixi Yao et.al.	2409.08482	null
2024-09-12	Do Vision Foundation Models Enhance Domain Generalization in Medical Image Segmentation?	Kerem Cekmeceli et.al.	2409.07960	link
2024-09-11	Efficient Localized Adaptation of Neural Weather Forecasting: A Case Study in the MENA Region	Muhammad Akhtar Munir et.al.	2409.07585	link
2024-09-10	Sam2Rad: A Segmentation Model for Medical Images with Learnable Prompts	Assefa Seyoum Wahd et.al.	2409.06821	link
2024-09-11	Ferret: Federated Full-Parameter Tuning at Scale for Large Language Models	Yao Shu et.al.	2409.06277	link
2024-09-09	SVFit: Parameter-Efficient Fine-Tuning of Large Pre-Trained Models Using Singular Values	Chengwei Sun et.al.	2409.05926	null
2024-09-10	Improving Multimodal Emotion Recognition by Leveraging Acoustic Adaptation and Visual Alignment	Zhixian Zhao et.al.	2409.05015	null
2024-09-06	Customizing Large Language Model Generation Style using Parameter-Efficient Finetuning	Xinyue Liu et.al.	2409.04574	null
2024-09-04	iConFormer: Dynamic Parameter-Efficient Tuning with Input-Conditioned Adaptation	Hayeon Jo et.al.	2409.02838	null
2024-09-04	Deconfounded Causality-aware Parameter-Efficient Fine-Tuning for Problem-Solving Improvement of LLMs	Ruoyu Wang et.al.	2409.02686	null
2024-09-04	Robust Federated Finetuning of Foundation Models via Alternating Minimization of LoRA	Shuangyi Chen et.al.	2409.02346	null
2024-09-02	Unleashing the Power of Task-Specific Directions in Parameter Efficient Fine-tuning	Chongjie Si et.al.	2409.01035	link
2024-08-28	3-in-1: 2D Rotary Adaptation for Efficient Finetuning, Efficient Batching and Composability	Baohao Liao et.al.	2409.00119	link
2024-08-21	SORSA: Singular Values and Orthonormal Regularized Singular Vectors Adaptation of Large Language Models	Yang Cao et.al.	2409.00055	link
2024-08-30	MoRe Fine-Tuning with 10x Fewer Parameters	Wenxuan Tan et.al.	2408.17383	link
2024-09-02	Instant Adversarial Purification with Adversarial Consistency Distillation	Chun Tong Lei et.al.	2408.17064	null
2024-08-28	Scaling Up Summarization: Leveraging Large Language Models for Long Text Extractive Summarization	Léo Hemamou et.al.	2408.15801	null
2024-08-27	GIFT-SW: Gaussian noise Injected Fine-Tuning of Salient Weights for LLMs	Maxim Zhelnin et.al.	2408.15300	link
2024-08-27	Pre-training Everywhere: Parameter-Efficient Fine-Tuning for Medical Image Analysis via Target Parameter Pre-training	Xingliang Lei et.al.	2408.15011	null
2024-08-27	CVPT: Cross-Attention help Visual Prompt Tuning adapt visual task	Lingyun Huang et.al.	2408.14961	link
2024-08-27	Step-by-Step Unmasking for Parameter-Efficient Fine-tuning of Large Language Models	Aradhye Agarwal et.al.	2408.14470	link
2024-08-24	Advancing Enterprise Spatio-Temporal Forecasting Applications: Data Mining Meets Instruction Tuning of Language Models For Multi-modal Time Series Analysis in Low-Resource Settings	Sagar Srinivas Sakhinana et.al.	2408.13622	null
2024-08-21	Positional Prompt Tuning for Efficient 3D Representation Learning	Shaochen Zhang et.al.	2408.11567	link
2024-08-20	Pluto and Charon: A Time and Memory Efficient Collaborative Edge AI Framework for Personal LLMs Fine-Tuning	Bei Ouyang et.al.	2408.10746	null
2024-08-20	TDS-CLIP: Temporal Difference Side Network for Image-to-Video Transfer Learning	Bin Wang et.al.	2408.10688	link
2024-08-19	TeamLoRA: Boosting Low-Rank Adaptation with Expert Collaboration and Competition	Tianwei Lin et.al.	2408.09856	link
2024-08-16	Learning to Route for Dynamic Adapter Composition in Continual Learning with Language Models	Vladimir Araujo et.al.	2408.09053	null
2024-08-14	KIND: Knowledge Integration and Diversion in Diffusion Models	Yucheng Xie et.al.	2408.07337	link
2024-08-30	TaSL: Task Skill Localization and Consolidation for Language Model Continual Learning	Yujie Feng et.al.	2408.05200	link
2024-08-08	Bias-Aware Low-Rank Adaptation: Mitigating Catastrophic Inheritance of Large Language Models	Yupeng Chang et.al.	2408.04556	link
2024-08-06	SARA: Singular-Value Based Adaptive Low-Rank Adaption	Jihao Gu et.al.	2408.03290	null
2024-08-06	Leveraging Parameter Efficient Training Methods for Low Resource Text Classification: A Case Study in Marathi	Pranita Deshmukh et.al.	2408.03172	null
2024-08-03	TS-SAM: Fine-Tuning Segment-Anything Model for Downstream Tasks	Yang Yu et.al.	2408.01835	link
2024-08-02	MoDE: Effective Multi-task Parameter Efficient Fine-Tuning with a Mixture of Dyadic Experts	Lin Ning et.al.	2408.01505	null
2024-08-02	Tensor Train Low-rank Approximation (TT-LoRA): Democratizing AI with Accelerated LLMs	Afia Anjum et.al.	2408.01008	null
2024-07-31	A Federated Learning-Friendly Approach for Parameter-Efficient Fine-Tuning of SAM in 3D Segmentation	Mothilal Asokan et.al.	2407.21739	null
2024-07-28	Forecast-PEFT: Parameter-Efficient Fine-Tuning for Pre-trained Motion Forecasting Models	Jifeng Wang et.al.	2407.19564	link
2024-07-24	Parameter-Efficient Fine-Tuning for Continual Learning: A Neural Tangent Kernel Perspective	Jingren Liu et.al.	2407.17120	null
2024-07-22	Zero-Shot Embeddings Inform Learning and Forgetting with Vision-Language Encoders	Laura Niss et.al.	2407.15731	null
2024-07-21	Learn to Preserve and Diversify: Parameter-Efficient Group with Orthogonal Regularization for Domain Generalization	Jiajun Hu et.al.	2407.15085	link
2024-07-16	InstructAV: Instruction Fine-tuning Large Language Models for Authorship Verification	Yujia Hu et.al.	2407.12882	link
2024-07-18	Turning Generative Models Degenerate: The Power of Data Poisoning Attacks	Shuli Jiang et.al.	2407.12281	null
2024-07-16	Probing the Efficacy of Federated Parameter-Efficient Fine-Tuning of Vision Transformers for Medical Image Classification	Naif Alkhunaizi et.al.	2407.11573	null
2024-07-16	An efficient framework based on large foundation model for cervical cytopathology whole slide image screening	Jialong Huang et.al.	2407.11486	link
2024-07-10	RoLoRA: Fine-tuning Rotated Outlier-free LLMs for Effective Weight-Activation Quantization	Xijie Huang et.al.	2407.08044	link
2024-07-10	ROSA: Random Subspace Adaptation for Efficient Fine-Tuning	Marawan Gamal Abdel Hameed et.al.	2407.07802	link
2024-07-10	Parameter Efficient Fine Tuning for Multi-scanner PET to PET Reconstruction	Yumin Kim et.al.	2407.07517	null
2024-07-09	Reprogramming Distillation for Medical Foundation Models	Yuhang Zhou et.al.	2407.06504	link
2024-07-07	See Further for Parameter Efficient Fine-tuning by Standing on the Shoulders of Decomposition	Chongjie Si et.al.	2407.05417	link
2024-07-16	LoRA-GA: Low-Rank Adaptation with Gradient Approximation	Shaowen Wang et.al.	2407.05000	link
2024-07-05	GPT vs RETRO: Exploring the Intersection of Retrieval and Parameter-Efficient Fine-Tuning	Aleksander Ficek et.al.	2407.04528	null
2024-07-04	Deep Content Understanding Toward Entity and Aspect Target Sentiment Analysis on Foundation Models	Vorakit Vorakitphan et.al.	2407.04050	link
2024-07-04	ASteISR: Adapting Single Image Super-resolution Pre-trained Model for Efficient Stereo Image Super-resolution	Yuanbo Zhou et.al.	2407.03598	link
2024-07-03	Knowledge Composition using Task Vectors with Learned Anisotropic Scaling	Frederic Z. Zhang et.al.	2407.02880	link
2024-07-03	Exploring the Capabilities of LLMs for Code Change Related Tasks	Lishui Fan et.al.	2407.02824	link
2024-07-02	FineCLIPER: Multi-modal Fine-grained CLIP for Dynamic Facial Expression Recognition with AdaptERs	Haodong Chen et.al.	2407.02157	null
2024-07-02	CatMemo at the FinLLM Challenge Task: Fine-Tuning Large Language Models using Data Fusion in Financial Applications	Yupeng Cao et.al.	2407.01953	null
2024-07-05	Let the Expert Stick to His Last: Expert-Specialized Fine-Tuning for Sparse Architectural Large Language Models	Zihan Wang et.al.	2407.01906	link
2024-07-01	A Fingerprint for Large Language Models	Zhiguang Yang et.al.	2407.01235	null
2024-07-02	Embedded Prompt Tuning: Towards Enhanced Calibration of Pretrained Models for Medical Images	Wenqiang Zu et.al.	2407.01003	link
2024-06-25	Structured Unrestricted-Rank Matrices for Parameter Efficient Fine-tuning	Arijit Sehanobish et.al.	2406.17740	link
2024-06-19	Parameter Training Efficiency Aware Resource Allocation for AIGC in Space-Air-Ground Integrated Networks	Liangxin Qian et.al.	2406.13602	null
2024-06-19	Sparse High Rank Adapters	Kartikeya Bhardwaj et.al.	2406.13175	null
2024-06-18	Bayesian-LoRA: LoRA based Parameter Efficient Fine-Tuning using Optimal Quantization levels and Rank Values trough Differentiable Bayesian Gates	Cristian Meo et.al.	2406.13046	null
2024-06-18	Fighting Randomness with Randomness: Mitigating Optimisation Instability of Fine-Tuning using Delayed Ensemble and Noisy Interpolation	Branislav Pecher et.al.	2406.12471	link
2024-06-17	A Semantic-based Layer Freezing Approach to Efficient Fine-Tuning of Language Models	Jian Gu et.al.	2406.11753	null
2024-06-16	ExPLoRA: Parameter-Efficient Extended Pre-Training to Adapt Vision Transformers under Domain Shifts	Samar Khanna et.al.	2406.10973	null
2024-06-16	ShareLoRA: Parameter Efficient and Robust Large Language Model Fine-tuning via Shared Low-Rank Adaptation	Yurun Song et.al.	2406.10785	link
2024-06-16	RoseLoRA: Row and Column-wise Sparse Low-rank Adaptation of Pre-trained Language Model for Knowledge Editing and Fine-tuning	Haoyu Wang et.al.	2406.10777	link
2024-06-15	Benchmarking Children's ASR with Supervised and Self-supervised Speech Foundation Models	Ruchao Fan et.al.	2406.10507	link
2024-06-15	Personalized Pieces: Efficient Personalized Large Language Models through Collaborative Efforts	Zhaoxuan Tan et.al.	2406.10471	link
2024-06-13	Reflecting on the State of Rehearsal-free Continual Learning with Pretrained Models	Lukas Thede et.al.	2406.09384	null
2024-06-12	Exploring Fact Memorization and Style Imitation in LLMs Using QLoRA: An Experimental Study and Quality Assessment Methods	Eugene Vyborov et.al.	2406.08582	null
2024-06-12	The Impact of Initialization on LoRA Finetuning Dynamics	Soufiane Hayou et.al.	2406.08447	null
2024-06-20	Low-Rank Quantization-Aware Training for LLMs	Yelysei Bondarenko et.al.	2406.06385	link
2024-06-10	A Parameter-efficient Language Extension Framework for Multilingual ASR	Wei Liu et.al.	2406.06329	null
2024-06-09	A Comprehensive Evaluation of Parameter-Efficient Fine-Tuning on Automated Program Repair	Guochang Li et.al.	2406.05639	link
2024-06-07	Efficient Differentially Private Fine-Tuning of Diffusion Models	Jing Liu et.al.	2406.05257	null
2024-06-07	CorDA: Context-Oriented Decomposition Adaptation of Large Language Models	Yibo Yang et.al.	2406.05223	link
2024-06-07	An Empirical Study on Parameter-Efficient Fine-Tuning for MultiModal Large Language Models	Xiongtao Zhou et.al.	2406.05130	link
2024-06-07	MEFT: Memory-Efficient Fine-Tuning through Sparse Adapter	Jitai Hao et.al.	2406.04984	link
2024-06-06	Time Sensitive Knowledge Editing through Efficient Finetuning	Xiou Ge et.al.	2406.04496	link
2024-06-06	VHDL-Eval: A Framework for Evaluating Large Language Models in VHDL Code Generation	Prashanth Vijayaraghavan et.al.	2406.04379	null
2024-06-10	Hypernetworks for Personalizing ASR to Atypical Speech	Max Müller-Eberstein et.al.	2406.04240	null
2024-06-06	Light-PEFT: Lightening Parameter-Efficient Fine-Tuning via Early Pruning	Naibin Gu et.al.	2406.03792	link
2024-06-05	Choice of PEFT Technique in Continual Learning: Prompt Tuning is Not All You Need	Martin Wistuba et.al.	2406.03216	null
2024-06-06	Adapter-X: A Novel General Parameter-Efficient Fine-Tuning Framework for Vision	Minglei Li et.al.	2406.03051	null
2024-05-31	Mamba State-Space Models Can Be Strong Downstream Learners	John T. Halloran et.al.	2406.00209	null
2024-05-30	ETHER: Efficient Finetuning of Large-Scale Models with Hyperplane Reflections	Massimo Bini et.al.	2405.20271	link
2024-05-30	SVFT: Parameter-Efficient Fine-Tuning with Singular Vectors	Vijay Lingam et.al.	2405.19597	link
2024-05-29	MemControl: Mitigating Memorization in Medical Diffusion Models via Automated Parameter Selection	Raman Dutt et.al.	2405.19458	link
2024-05-29	MLAE: Masked LoRA Experts for Parameter-Efficient Fine-Tuning	Junjie Wang et.al.	2405.18897	link
2024-05-29	Parameter-efficient Fine-tuning in Hyperspherical Space for Open-vocabulary Semantic Segmentation	Zelin Peng et.al.	2405.18840	null
2024-06-01	Low-Rank Few-Shot Adaptation of Vision-Language Models	Maxime Zanella et.al.	2405.18541	null
2024-05-28	Semantic are Beacons: A Semantic Perspective for Unveiling Parameter-Efficient Fine-Tuning in Knowledge Learning	Renzhi Wang et.al.	2405.18292	null
2024-05-28	VeLoRA: Memory Efficient Training using Rank-1 Sub-Token Projections	Roy Miles et.al.	2405.17991	link
2024-05-28	Sparsity- and Hybridity-Inspired Visual Parameter-Efficient Fine-Tuning for Medical Diagnosis	Mingyuan Liu et.al.	2405.17877	null
2024-05-27	LoRA-XS: Low-Rank Adaptation with Extremely Small Number of Parameters	Klaudia Bałazy et.al.	2405.17604	link
2024-05-23	EMR-Merging: Tuning-Free High-Performance Model Merging	Chenyu Huang et.al.	2405.17461	link
2024-05-28	DoRA: Enhancing Parameter-Efficient Fine-Tuning with Dynamic Rank Distribution	Yulong Mao et.al.	2405.17357	link
2024-05-27	$\textit{Trans-LoRA}$ : towards data-free Transferable Parameter Efficient Finetuning	Runqian Wang et.al.	2405.17258	null
2024-05-30	Sparse Matrix in Large Language Model Fine-tuning	Haoze He et.al.	2405.15525	null
2024-05-24	Prompt Tuning Strikes Back: Customizing Foundation Models with Low-Rank Prompt Adaptation	Abhinav Jain et.al.	2405.15282	link
2024-05-27	VB-LoRA: Extreme Parameter Efficient Fine-Tuning with Vector Banks	Yang Li et.al.	2405.15179	link
2024-05-23	Bitune: Bidirectional Instruction-Tuning	Dawid J. Kopiczko et.al.	2405.14862	null
2024-05-23	Sparse-Tuning: Adapting Vision Transformers with Efficient Fine-tuning and Inference	Ting Liu et.al.	2405.14700	link
2024-05-22	Spectral Adapter: Fine-Tuning in Spectral Space	Fangzhao Zhang et.al.	2405.13952	link
2024-05-24	MeteoRA: Multiple-tasks Embedded LoRA for Large Language Models	Jingwei Xu et.al.	2405.13053	link
2024-05-20	FeTT: Continual Class Incremental Learning via Feature Transformation Tuning	Sunyuan Qiang et.al.	2405.11822	null
2024-05-21	HARIS: Human-Like Attention for Reference Image Segmentation	Mengxi Zhang et.al.	2405.10707	null
2024-05-28	DP-DyLoRA: Fine-Tuning Transformer-Based Models On-Device under Differentially Private Federated Learning using Dynamic Low-Rank Adaptation	Jie Xu et.al.	2405.06368	null
2024-05-09	Selective Fine-tuning on LLM-labeled Data May Reduce Reliance on Human Annotation: A Case Study Using Schedule-of-Event Table Detection	Bhawesh Kumar et.al.	2405.06093	null
2024-05-09	Memory-Space Visual Prompting for Efficient Vision-Language Fine-Tuning	Shibo Jie et.al.	2405.05615	link
2024-05-07	Refining Joint Text and Source Code Embeddings for Retrieval Task with Parameter-Efficient Fine-Tuning	Karim Galliamov et.al.	2405.04126	link
2024-05-04	Random Masking Finds Winning Tickets for Parameter Efficient Fine-tuning	Jing Xu et.al.	2405.02596	link
2024-03-16	Empirical Studies of Parameter Efficient Methods for Large Language Models of Code and Knowledge Transfer to R	Amirreza Esmaeili et.al.	2405.01553	link
2024-05-02	NeMo-Aligner: Scalable Toolkit for Efficient Model Alignment	Gerald Shen et.al.	2405.01481	link
2024-04-29	LoRA Land: 310 Fine-tuned LLMs that Rival GPT-4, A Technical Report	Justin Zhao et.al.	2405.00732	link
2024-05-01	Investigating Automatic Scoring and Feedback using Large Language Models	Gloria Ashiya Katuka et.al.	2405.00602	null
2024-05-01	MoPEFT: A Mixture-of-PEFTs for the Segment Anything Model	Rajat Sahay et.al.	2405.00293	null
2024-04-30	SPAFIT: Stratified Progressive Adaptation Fine-tuning for Pre-trained Large Language Models	Samir Arora et.al.	2405.00201	null
2024-05-23	HydraLoRA: An Asymmetric LoRA Architecture for Efficient Fine-Tuning	Chunlin Tian et.al.	2404.19245	link
2024-05-25	FeDeRA:Efficient Fine-tuning of Language Models in Federated Learning Leveraging Weight Decomposition	Yuxuan Yan et.al.	2404.18848	null
2024-04-25	Efficiency in Focus: LayerNorm as a Catalyst for Fine-tuning Medical Visual Language Pre-trained Models	Jiawei Chen et.al.	2404.16385	null
2024-05-23	MixLoRA: Enhancing Large Language Models Fine-Tuning with LoRA-based Mixture of Experts	Dengchun Li et.al.	2404.15159	link
2024-04-22	ColA: Collaborative Adaptation with Gradient Learning	Enmao Diao et.al.	2404.13844	link
2024-04-23	Parameter Efficient Fine Tuning: A Comprehensive Analysis Across Applications	Charith Chandra Sai Balne et.al.	2404.13506	null
2024-04-18	SKIP: Skill-Localized Prompt Tuning for Inference Speed Boost-Up	Nakyeong Yang et.al.	2404.11916	null
2024-04-16	Shears: Unstructured Sparsity with Neural Low-rank Adapter Search	J. Pablo Muñoz et.al.	2404.10934	link
2024-04-16	Exact and Efficient Unlearning for Large Language Model-based Recommendation	Zhiyu Hu et.al.	2404.10327	null
2024-04-15	LoRA Dropout as a Sparsity Regularizer for Overfitting Control	Yang Lin et.al.	2404.09610	null
2024-04-21	Analyzing the Impact of Data Selection and Fine-Tuning on Economic and Political Biases in LLMs	Ahmed Agiza et.al.	2404.08699	link
2024-04-08	Certified PEFTSmoothing: Parameter-Efficient Fine-Tuning with Randomized Smoothing	Chengyan Fu et.al.	2404.05350	null
2024-04-08	DLoRA: Distributed Parameter-Efficient Fine-Tuning Solution for Large Language Model	Chao Gao et.al.	2404.05182	null
2024-04-12	Q-PEFT: Query-dependent Parameter Efficient Fine-tuning for Text Reranking with Large Language Models	Zhiyuan Peng et.al.	2404.04522	null
2024-04-05	Unlocking Parameter-Efficient Fine-Tuning for Low-Resource Language Translation	Tong Su et.al.	2404.04212	null
2024-05-22	ReFT: Representation Finetuning for Language Models	Zhengxuan Wu et.al.	2404.03592	link
2024-06-11	Personalized LLM Response Generation with Parameterized Memory Injection	Kai Zhang et.al.	2404.03565	link
2024-06-20	Eigenpruning: an Interpretability-Inspired PEFT Method	Tomás Vergara-Browne et.al.	2404.03147	link
2024-05-28	PiSSA: Principal Singular Values and Singular Vectors Adaptation of Large Language Models	Fanxu Meng et.al.	2404.02948	link
2024-04-03	Enhancing Low-Resource LLMs Classification with PEFT and Synthetic Data	Parth Patwa et.al.	2404.02422	null
2024-04-11	IISAN: Efficiently Adapting Multimodal Representation for Sequential Recommendation with Decoupled PEFT	Junchen Fu et.al.	2404.02059	link
2024-03-31	Query-driven Relevant Paragraph Extraction from Legal Judgments	T. Y. S. S Santosh et.al.	2404.00595	null
2024-03-30	Edinburgh Clinical NLP at SemEval-2024 Task 2: Fine-tune your model unless you have access to GPT-4	Aryo Pradipta Gema et.al.	2404.00484	link
2024-04-03	InfLoRA: Interference-Free Low-Rank Adaptation for Continual Learning	Yan-Shuo Liang et.al.	2404.00228	link
2024-03-27	Is Modularity Transferable? A Case Study through the Lens of Knowledge Distillation	Mateusz Klimaszewski et.al.	2403.18804	link
2024-03-26	The Unreasonable Ineffectiveness of the Deeper Layers	Andrey Gromov et.al.	2403.17887	null
2024-04-15	ALoRA: Allocating Low-Rank Adaptation for Fine-tuning Large Language Models	Zequan Liu et.al.	2403.16187	null
2024-03-22	KnowLA: Enhancing Parameter-efficient Finetuning with Knowledgeable Adaptation	Xindi Luo et.al.	2403.14950	link
2024-03-22	A Single Linear Layer Yields Task-Adapted Low-Rank Matrices	Hwichan Kim et.al.	2403.14946	null
2024-03-21	AutoRE: Document-Level Relation Extraction with Large Language Models	Xue Lilong et.al.	2403.14888	link
2024-04-29	Parameter-Efficient Fine-Tuning for Large Models: A Comprehensive Survey	Zeyu Han et.al.	2403.14608	null
2024-03-20	Harnessing Large Language Models for Text-Rich Sequential Recommendation	Zhi Zheng et.al.	2403.13325	link
2024-04-16	AFLoRA: Adaptive Freezing of Low Rank Adaptation in Parameter Efficient Fine-Tuning of Large Models	Zeyu Liu et.al.	2403.13269	null
2024-03-18	Improving LoRA in Privacy-preserving Federated Learning	Youbang Sun et.al.	2403.12313	null
2024-03-18	Dynamic Tuning Towards Parameter and Inference Efficiency for ViT Adaptation	Wangbo Zhao et.al.	2403.11808	link
2024-03-18	Let's Focus on Neuron: Neuron-Level Supervised Fine-tuning for Large Language Model	Haoyun Xu et.al.	2403.11621	null
2024-03-19	JORA: JAX Tensor-Parallel LoRA Library for Retrieval Augmented Fine-Tuning	Anique Tahir et.al.	2403.11366	link
2024-03-14	Introducing Routing Functions to Vision-Language Parameter-Efficient Fine-Tuning with Low-Rank Bottlenecks	Tingyu Qu et.al.	2403.09377	link
2024-03-14	PYRA: Parallel Yielding Re-Activation for Training-Inference Efficient Task Adaptation	Yizhe Xiong et.al.	2403.09192	link
2024-03-13	Data-oriented Dynamic Fine-tuning Parameter Selection Strategy for FISH Mask based Efficient Fine-tuning	Ming Dong et.al.	2403.08484	null

(back to top)

Text-to-Image Generation

Publish Date	Title	Authors	PDF	Code
2025-05-27	Paper2Poster: Towards Multimodal Poster Automation from Scientific Papers	Wei Pang et.al.	2505.21497	null
2025-05-27	Be Decisive: Noise-Induced Layouts for Multi-Subject Generation	Omer Dahary et.al.	2505.21488	null
2025-05-27	PropMolFlow: Property-guided Molecule Generation with Geometry-Complete Flow Matching	Cheng Zeng et.al.	2505.21469	null
2025-05-27	Accelerating Diffusion Language Model Inference via Efficient KV Caching and Guided Diffusion	Zhanqiu Hu et.al.	2505.21467	null
2025-05-27	Designing Cyclic Peptides via Harmonic SDE with Atom-Bond Modeling	Xiangxin Zhou et.al.	2505.21452	null
2025-05-27	CoDA: Coordinated Diffusion Noise Optimization for Whole-Body Manipulation of Articulated Objects	Huaijin Pi et.al.	2505.21437	null
2025-05-27	Learning Individual Behavior in Agent-Based Models with Graph Diffusion Networks	Francesco Cozzi et.al.	2505.21426	null
2025-05-27	GUARD:Dual-Agent based Backdoor Defense on Chain-of-Thought in Neural Code Generation	Naizhu Jin et.al.	2505.21425	null
2025-05-27	A Framework for Adversarial Analysis of Decision Support Systems Prior to Deployment	Brett Bissey et.al.	2505.21414	null
2025-05-27	A Convergence Theory for Diffusion Language Models: An Information-Theoretic Perspective	Gen Li et.al.	2505.21400	null
2025-05-28	OVERT: A Benchmark for Over-Refusal Evaluation on Text-to-Image Models	Ziheng Cheng et.al.	2505.21347	link
2025-05-28	MagicTryOn: Harnessing Diffusion Transformer for Garment-Preserving Video Virtual Try-on	Guangyuan Li et.al.	2505.21325	null
2025-05-27	Evaluation of LLMs in Medical Text Summarization: The Role of Vocabulary Adaptation in High OOV Settings	Gunjan Balde et.al.	2505.21242	null
2025-05-28	Custom Representations of Inductive Families	Constantine Theocharis et.al.	2505.21225	null
2025-05-27	Simulations of the churning mode: toroidally symmetric plasma convection and turbulence around the X-points in a snowflake divertor	D Power et.al.	2505.21223	null
2025-05-26	Multimodal Federated Learning With Missing Modalities through Feature Imputation Network	Pranav Poudel et.al.	2505.20232	null
2025-05-26	Continuous Learning for Children's ASR: Overcoming Catastrophic Forgetting with Elastic Weight Consolidation and Synaptic Intelligence	Edem Ahadzi et.al.	2505.20216	null
2025-05-26	Adaptive Classifier-Free Guidance via Dynamic Low-Confidence Masking	Pengxiang Li et.al.	2505.20199	null
2025-05-26	Private Geometric Median in Nearly-Linear Time	Syamantak Kumar et.al.	2505.20189	null
2025-05-26	Exposing Go's Hidden Bugs: A Novel Concolic Framework	Karolina Gorna et.al.	2505.20183	null
2025-05-26	Long-Context State-Space Video World Models	Ryan Po et.al.	2505.20171	null
2025-05-26	MolEditRL: Structure-Preserving Molecular Editing via Discrete Diffusion and Reinforcement Learning	Yuanxin Zhuang et.al.	2505.20131	null
2025-05-26	Understanding Generalization in Diffusion Models via Probability Flow Distance	Huijie Zhang et.al.	2505.20123	null
2025-05-26	Proxy-Free GFlowNet	Ruishuo Chen et.al.	2505.20110	null
2025-05-26	Refining Few-Step Text-to-Multiview Diffusion via Reinforcement Learning	Ziyi Zhang et.al.	2505.20107	null
2025-05-26	Safety Through Reasoning: An Empirical Study of Reasoning Guardrail Models	Makesh Narsimhan Sreedhar et.al.	2505.20087	null
2025-05-26	PAMD: Plausibility-Aware Motion Diffusion Model for Long Dance Generation	Hongsong Wang et.al.	2505.20056	null
2025-05-26	Multimodal LLM-Guided Semantic Correction in Text-to-Image Diffusion	Zheqi Lv et.al.	2505.20053	null
2025-05-26	The Many Challenges of Human-Like Agents in Virtual Game Environments	Maciej Świechowski et.al.	2505.20011	null
2025-05-26	ICDM: Interference Cancellation Diffusion Models for Wireless Semantic Communications	Tong Wu et.al.	2505.19983	null
2025-05-26	Rethinking Probabilistic Circuit Parameter Learning	Anji Liu et.al.	2505.19982	null
2025-05-26	UltraVSR: Achieving Ultra-Realistic Video Super-Resolution with Efficient One-Step Diffusion Space	Yong Liu et.al.	2505.19958	null
2025-05-26	Underwater Diffusion Attention Network with Contrastive Language-Image Joint Learning for Underwater Image Enhancement	Afrah Shaahid et.al.	2505.19895	null
2025-05-26	A fully automated urban PV parameterization framework for improved estimation of energy production profiles	Bowen Tian et.al.	2505.19876	null
2025-05-26	StyleAR: Customizing Multimodal Autoregressive Model for Style-Aligned Text-to-Image Generation	Yi Wu et.al.	2505.19874	null
2025-05-26	Harnessing the Power of Training-Free Techniques in Text-to-2D Generation for Text-to-3D Generation via Score Distillation Sampling	Junhong Lee et.al.	2505.19868	null
2025-05-23	Generative Distribution Embeddings	Nic Fishman et.al.	2505.18150	null
2025-05-23	Stochastic agent-based Monte Carlo simulations for reaction-diffusion models, population dynamics, and epidemic spreading	Mohamed Swailem et.al.	2505.18145	null
2025-05-26	TokBench: Evaluating Your Visual Tokenizer before Visual Generation	Junfeng Wu et.al.	2505.18142	null
2025-05-23	One RL to See Them All: Visual Triple Unified Reinforcement Learning	Yan Ma et.al.	2505.18129	null
2025-05-23	Towards more transferable adversarial attack in black-box manner	Chun Tong Lei et.al.	2505.18097	null
2025-05-23	DualTalk: Dual-Speaker Interaction for 3D Talking Head Conversations	Ziqiao Peng et.al.	2505.18096	null
2025-05-23	SpikeGen: Generative Framework for Visual Spike Stream Processing	Gaole Dai et.al.	2505.18049	null
2025-05-23	RestoreVAR: Visual Autoregressive Generation for All-in-One Image Restoration	Sudarshan Rajagopalan et.al.	2505.18047	null
2025-05-26	Strictly Constrained Generative Modeling via Split Augmented Langevin Sampling	Matthieu Blanke et.al.	2505.18017	null
2025-05-23	Segment Anyword: Mask Prompt Inversion for Open-Set Grounded Segmentation	Zhihua Liu et.al.	2505.17994	null
2025-05-23	To Glue or Not to Glue? Classical vs Learned Image Matching for Mobile Mapping Cameras to Textured Semantic 3D Building Models	Simone Gaisbauer et.al.	2505.17973	null
2025-05-23	Diffusion Classifiers Understand Compositionality, but Conditions Apply	Yujin Jeong et.al.	2505.17955	null
2025-05-23	SplatCo: Structure-View Collaborative Gaussian Splatting for Detail-Preserving Rendering of Large-Scale Unbounded Scenes	Haihong Xiao et.al.	2505.17951	null
2025-05-23	Survival Games: Human-LLM Strategic Showdowns under Severe Resource Scarcity	Zhihong Chen et.al.	2505.17937	null
2025-05-23	Flexible MOF Generation with Torsion-Aware Flow Matching	Nayoung Kim et.al.	2505.17914	null
2025-05-22	GoT-R1: Unleashing Reasoning Capability of MLLM for Visual Generation with Reinforcement Learning	Chengqi Duan et.al.	2505.17022	link
2025-05-22	When Are Concepts Erased From Diffusion Models?	Kevin Lu et.al.	2505.17013	null
2025-05-22	Guided Diffusion Sampling on Function Spaces with Applications to PDEs	Jiachen Yao et.al.	2505.17004	link
2025-05-22	Pursuing Temporal-Consistent Video Virtual Try-On via Dynamic Pose Interaction	Dong Li et.al.	2505.16980	null
2025-05-22	Incorporating Visual Correspondence into Diffusion Model for Virtual Try-On	Siqi Wan et.al.	2505.16977	link
2025-05-22	Creatively Upscaling Images with Global-Regional Priors	Yurui Qian et.al.	2505.16976	null
2025-05-22	Bigger Isn't Always Memorizing: Early Stopping Overparameterized Diffusion Models	Alessandro Favero et.al.	2505.16959	null
2025-05-22	From Reality to Virtual Worlds: The Role of Photogrammetry in Game Development	Santiago Berrezueta-Guzman et.al.	2505.16951	null
2025-05-22	LLaDA-V: Large Language Diffusion Models with Visual Instruction Tuning	Zebin You et.al.	2505.16933	null
2025-05-22	Code Graph Model (CGM): A Graph-Integrated Large Language Model for Repository-Level Software Engineering Tasks	Hongyuan Tao et.al.	2505.16901	null
2025-05-22	T2I-ConBench: Text-to-Image Benchmark for Continual Post-training	Zhehao Huang et.al.	2505.16875	null
2025-05-22	Training-Free Efficient Video Generation via Dynamic Token Carving	Yuechen Zhang et.al.	2505.16864	link
2025-05-22	Conditional Panoramic Image Generation via Masked Autoregressive Modeling	Chaoyang Wang et.al.	2505.16862	null
2025-05-23	LaViDa: A Large Diffusion Language Model for Multimodal Understanding	Shufan Li et.al.	2505.16839	link
2025-05-22	From EduVisBench to EduVisAgent: A Benchmark and Multi-Agent Framework for Pedagogical Visualization	Haonian Ji et.al.	2505.16832	link
2025-05-21	Leveraging the Powerful Attention of a Pre-trained Diffusion Model for Exemplar-based Image Colorization	Satoshi Kosugi et.al.	2505.15812	link
2025-05-21	On the creation of narrow AI: hierarchy and nonlocality of neural network skills	Eric J. Michaud et.al.	2505.15811	link
2025-05-21	Neural Conditional Transport Maps	Carlos Rodriguez-Pardo et.al.	2505.15808	null
2025-05-21	Interspatial Attention for Efficient 4D Human Video Generation	Ruizhi Shao et.al.	2505.15800	null
2025-05-21	VARD: Efficient and Dense Fine-Tuning for Diffusion Models with Value-based RL	Fengyuan Dai et.al.	2505.15791	null
2025-05-21	Exploring the Innovation Opportunities for Pre-trained Models	Minjung Park et.al.	2505.15790	null
2025-05-21	IA-T2I: Internet-Augmented Text-to-Image Generation	Chuanhao Li et.al.	2505.15779	null
2025-05-21	Constructing a 3D Town from a Single Image	Kaizhi Zheng et.al.	2505.15765	null
2025-05-21	HybridProver: Augmenting Theorem Proving with LLM-Driven Proof Synthesis and Refinement	Jilin Hu et.al.	2505.15740	null
2025-05-21	Distributionally Robust Planning of Hydrogen-Electrical Microgrids for Sea Islands	Yuchen Dong et.al.	2505.15733	null
2025-05-21	Can Large Language Models be Effective Online Opinion Miners?	Ryang Heo et.al.	2505.15695	null
2025-05-21	SwarmDiff: Swarm Robotic Trajectory Planning in Cluttered Environments via Diffusion Transformer	Kang Ding et.al.	2505.15679	null
2025-05-21	Graph Conditional Flow Matching for Relational Data Generation	Davide Scassola et.al.	2505.15668	link
2025-05-21	FragFake: A Dataset for Fine-Grained Detection of Edited Images with Vision Language Models	Zhen Sun et.al.	2505.15644	link
2025-05-21	Trial and Return Option Strategy in Omnichannel Retailing	Yasuyuki Kusuda et.al.	2505.15597	null
2025-05-20	NExT-Search: Rebuilding User Feedback Ecosystem for Generative AI Search	Sunhao Dai et.al.	2505.14680	null
2025-05-20	Training-Free Watermarking for Autoregressive Image Generation	Yu Tong et.al.	2505.14673	null
2025-05-21	General-Reasoner: Advancing LLM Reasoning Across All Domains	Xueguang Ma et.al.	2505.14652	null
2025-05-20	Enhancing Learned Knowledge in LoRA Adapters Through Efficient Contrastive Decoding on Ascend NPUs	Morgan Lindsay Heisler et.al.	2505.14620	null
2025-05-20	Towards a Foundation Model for Communication Systems	Davide Buffelli et.al.	2505.14603	null
2025-05-20	Neural Inverse Scattering with Score-based Regularization	Yuan Gao et.al.	2505.14560	null
2025-05-20	Dynadiff: Single-stage Decoding of Images from Continuously Evolving fMRI	Marlène Careil et.al.	2505.14556	link
2025-05-20	GUARD: Constructing Realistic Two-Player Matrix and Security Games for Benchmarking Game-Theoretic Algorithms	Noah Krever et.al.	2505.14547	null
2025-05-20	NavBench: A Unified Robotics Benchmark for Reinforcement Learning-Based Autonomous Navigation	Matteo El-Hariry et.al.	2505.14526	null
2025-05-21	Sparc3D: Sparse Representation and Construction for High-Resolution 3D Shapes Modeling	Zhihao Li et.al.	2505.14521	null
2025-05-20	Learning to Integrate Diffusion ODEs by Averaging the Derivatives	Wenze Liu et.al.	2505.14502	null
2025-05-20	A Direct Comparison of Simultaneously Recorded Scalp, Around-Ear, and In-Ear EEG for Neural Selective Auditory Attention Decoding to Speech	Simon Geirnaert et.al.	2505.14478	null
2025-05-20	Enhancing Interpretability of Sparse Latent Representations with Class Information	Farshad Sangari Abiz et.al.	2505.14476	null
2025-05-20	CtrlDiff: Boosting Large Diffusion Language Models with Dynamic Block Prediction and Controllable Generation	Chihan Huang et.al.	2505.14455	null
2025-05-20	Compositional amortized inference for large-scale hierarchical Bayesian models	Jonas Arruda et.al.	2505.14429	null
2025-05-19	Mean Flows for One-step Generative Modeling	Zhengyang Geng et.al.	2505.13447	null
2025-05-19	Synthetic-Powered Predictive Inference	Meshi Bashari et.al.	2505.13432	link
2025-05-20	A Practical Guide for Incorporating Symmetry in Diffusion Policy	Dian Wang et.al.	2505.13431	null
2025-05-19	Faster Video Diffusion with Trainable Sparse Attention	Peiyuan Zhang et.al.	2505.13389	null
2025-05-19	Restoration Score Distillation: From Corrupted Diffusion Pretraining to One-Step High-Quality Generation	Yasi Zhang et.al.	2505.13377	null
2025-05-20	Minimum-Excess-Work Guidance	Christopher Kolloff et.al.	2505.13375	null
2025-05-20	One-Step Offline Distillation of Diffusion-based Models via Koopman Modeling	Nimrod Berman et.al.	2505.13358	link
2025-05-19	Frequency-Dependent Power Consumption Modeling of CMOS Transmitters for WNoC Architectures	Mohammad Shahmoradi et.al.	2505.13310	null
2025-05-19	FlowPure: Continuous Normalizing Flows for Adversarial Purification	Elias Collaert et.al.	2505.13280	link
2025-05-19	Seeing the Unseen: How EMoE Unveils Bias in Text-to-Image Diffusion Models	Lucas Berry et.al.	2505.13273	null
2025-05-19	Distilling a speech and music encoder with task arithmetic	Fabian Ritter-Gutierrez et.al.	2505.13270	null
2025-05-19	Correlation between U/Th and Pb/Os abundance ratios and its application in nuclear cosmochronology	Y. Y. Huang et.al.	2505.13269	null
2025-05-19	JNLP at SemEval-2025 Task 11: Cross-Lingual Multi-Label Emotion Detection Using Generative Models	Jieying Xue et.al.	2505.13244	link
2025-05-19	Conformalized Decision Risk Assessment	Wenbin Zhou et.al.	2505.13243	null
2025-05-19	Diffusion Models with Double Guidance: Generate with aggregated datasets	Yanfeng Yang et.al.	2505.13213	null
2025-05-16	Evolution of granular salty ice analogs for Europa: Sublimation and Irradiation	Rafael Ottersberg et.al.	2505.11498	null
2025-05-16	QVGen: Pushing the Limit of Quantized Video Generative Models	Yushi Huang et.al.	2505.11497	null
2025-05-16	Unsupervised Detection of Distribution Shift in Inverse Problems using Diffusion Models	Shirin Shoushtari et.al.	2505.11482	null
2025-05-16	PSDiffusion: Harmonized Multi-Layer Image Generation via Layout and Appearance Alignment	Dingbang Huang et.al.	2505.11468	null
2025-05-16	Exploiting Radiance Fields for Grasp Generation on Novel Synthetic Views	Abhishek Kashyap et.al.	2505.11467	null
2025-05-16	Disentangling Reasoning and Knowledge in Medical Large Language Models	Rahul Thapa et.al.	2505.11462	null
2025-05-16	A Generative Framework for Causal Estimation via Importance-Weighted Diffusion Distillation	Xinran Song et.al.	2505.11444	null
2025-05-19	MegaScale-MoE: Large-Scale Communication-Efficient Training of Mixture-of-Experts Models in Production	Chao Jin et.al.	2505.11432	null
2025-05-16	Diff-Unfolding: A Model-Based Score Learning Framework for Inverse Problems	Yuanhao Wang et.al.	2505.11393	null
2025-05-16	LipDiffuser: Lip-to-Speech Generation with Conditional Diffusion Models	Danilo de Oliveira et.al.	2505.11391	null
2025-05-16	MARRS: Masked Autoregressive Unit-based Reaction Synthesis	Y. B. Wang et.al.	2505.11334	null
2025-05-16	Decomposing stimulus-specific sensory neural information via diffusion models	Steeve Laquitaine et.al.	2505.11309	null
2025-05-16	Effective Probabilistic Time Series Forecasting with Fourier Adaptive Noise-Separated Diffusion	Xinyan Wang et.al.	2505.11306	null
2025-05-16	A Fourier Space Perspective on Diffusion Models	Fabian Falck et.al.	2505.11278	null
2025-05-16	DRAGON: A Large-Scale Dataset of Realistic Images Generated by Diffusion Models	Giulia Bertazzini et.al.	2505.11257	null
2025-05-15	3D-Fixup: Advancing Photo Editing with 3D Priors	Yen-Chi Cheng et.al.	2505.10566	null
2025-05-15	T2A-Feedback: Improving Basic Capabilities of Text-to-Audio Generation via Fine-grained AI Feedback	Zehan Wang et.al.	2505.10561	null
2025-05-15	Style Customization of Text-to-Vector Generation with Image Diffusion Priors	Peiying Zhang et.al.	2505.10558	null
2025-05-15	Flowing Through Hilbert Space: Quantum-Enhanced Generative Models for Lattice Field Theory	Jehu Martinez et.al.	2505.10553	null
2025-05-15	Does Feasibility Matter? Understanding the Impact of Feasibility on Synthetic Training Data	Yiwen Liu et.al.	2505.10551	link
2025-05-15	Pharmacophore-Conditioned Diffusion Model for Ligand-Based De Novo Drug Design	Amira Alakhdar et.al.	2505.10545	null
2025-05-15	LibIQ: Toward Real-Time Spectrum Classification in O-RAN dApps	Filippo Olimpieri et.al.	2505.10537	null
2025-05-15	Optimal Pricing With Impatient Customers	Jieqi Di et.al.	2505.10514	null
2025-05-15	CheXGenBench: A Unified Benchmark For Fidelity, Privacy and Utility of Synthetic Chest Radiographs	Raman Dutt et.al.	2505.10496	link
2025-05-15	Campus AI vs Commercial AI: A Late-Breaking Study on How LLM As-A-Service Customizations Shape Trust and Usage Patterns	Leon Hannig et.al.	2505.10490	null
2025-05-15	UniEval: Unified Holistic Evaluation for Unified Multimodal Understanding and Generation	Yi Li et.al.	2505.10483	null
2025-05-15	Fine-tuning Diffusion Policies with Backpropagation Through Diffusion Timesteps	Ningyuan Yang et.al.	2505.10482	null
2025-05-15	AI Agents vs. Agentic AI: A Conceptual Taxonomy, Applications and Challenge	Ranjan Sapkota et.al.	2505.10468	null
2025-05-15	Reinforcing the Diffusion Chain of Lateral Thought with Diffusion Language Models	Zemin Huang et.al.	2505.10446	null
2025-05-15	Score-based diffusion nowcasting of GOES imagery	Randy J. Chase et.al.	2505.10432	null
2025-05-14	Customizing a Large Language Model for VHDL Design of High-Performance Microprocessors	Nicolas Dupuis et.al.	2505.09610	null
2025-05-14	LightLab: Controlling Light Sources in Images with Diffusion Models	Nadav Magar et.al.	2505.09608	null
2025-05-14	Don't Forget your Inverse DDIM for Image Editing	Guillermo Gomez-Trenado et.al.	2505.09571	null
2025-05-14	BLIP3-o: A Family of Fully Open Unified Multimodal Models-Architecture, Training and Dataset	Jiuhai Chen et.al.	2505.09568	link
2025-05-14	CXMArena: Unified Dataset to benchmark performance in realistic CXM Scenarios	Raghav Garg et.al.	2505.09436	link
2025-05-14	Efficient Modelling of Lyman-α opacity fluctuations during late EoR	Barun Maity et.al.	2505.09369	null
2025-05-14	Diffusion Recommender Models and the Illusion of Progress: A Concerning Study of Reproducibility and a Conceptual Mismatch	Michael Benigni et.al.	2505.09364	null
2025-05-14	Marigold: Affordable Adaptation of Diffusion-Based Image Generators for Image Analysis	Bingxin Ke et.al.	2505.09358	link
2025-05-14	APR-Transformer: Initial Pose Estimation for Localization in Complex Environments through Absolute Pose Regression	Srinivas Ravuri et.al.	2505.09356	link
2025-05-14	Access Controls Will Solve the Dual-Use Dilemma	Evžen Wybitul et.al.	2505.09341	null
2025-05-14	DCSNet: A Lightweight Knowledge Distillation-Based Model with Explainable AI for Lung Cancer Diagnosis from Histopathological Images	Sadman Sakib Alif et.al.	2505.09334	null
2025-05-14	TransDiffuser: End-to-end Trajectory Generation with Decorrelated Multi-modal Representation for Autonomous Driving	Xuefeng Jiang et.al.	2505.09315	null
2025-05-14	Generating Full-field Evolution of Physical Dynamics from Irregular Sparse Observations	Panqi Chen et.al.	2505.09284	null
2025-05-14	A Note on Semantic Diffusion	Alexander P. Ryjov et.al.	2505.09283	null
2025-05-14	Few-Shot Anomaly-Driven Generation for Anomaly Classification and Segmentation	Guan Gui et.al.	2505.09263	link
2025-05-13	PCS-UQ: Uncertainty Quantification via the Predictability-Computability-Stability Framework	Abhineet Agarwal et.al.	2505.08784	null
2025-05-13	Generative Molecular Design with Steerable and Granular Synthesizability Control	Jeff Guo et.al.	2505.08774	link
2025-05-13	Controllable Image Colorization with Instance-aware Texts and Masks	Yanru An et.al.	2505.08705	null
2025-05-13	A Survey of Deep Learning for Complex Speech Spectrograms	Yuying Xie et.al.	2505.08694	null
2025-05-13	A Machine Learning Pipeline for Molecular Property Prediction using ChemXploreML	Aravindh Nivas Marimuthu et.al.	2505.08688	null
2025-05-13	Comparison of laser system designs for quantum technologies: BECCAL flight system vs. BECCAL ground test bed	Victoria A. Henderson et.al.	2505.08680	null
2025-05-13	A Study of Data-driven Methods for Inventory Optimization	Lee Yeung Ping et.al.	2505.08673	null
2025-05-13	WixQA: A Multi-Dataset Benchmark for Enterprise Retrieval-Augmented Generation	Dvir Cohen et.al.	2505.08643	null
2025-05-13	Visually Guided Decoding: Gradient-Free Hard Prompt Inversion with Language Models	Donghoon Kim et.al.	2505.08622	null
2025-05-13	Boosting Zero-shot Stereo Matching using Large-scale Mixed Images Sources in the Real World	Yuran Wang et.al.	2505.08607	null
2025-05-13	Extract the Best, Discard the Rest: CSI Feedback with Offline Large AI Models	Jialin Zhuang et.al.	2505.08566	null
2025-05-13	DFA-CON: A Contrastive Learning Approach for Detecting Copyright Infringement in DeepFake Art	Haroon Wahab et.al.	2505.08552	null
2025-05-13	Diffusion-assisted Model Predictive Control Optimization for Power System Real-Time Operation	Linna Xu et.al.	2505.08535	null
2025-05-13	Building-Block Aware Generative Modeling for 3D Crystals of Metal Organic Frameworks	Chenru Duan et.al.	2505.08531	link
2025-05-14	Improving Data Fidelity via Diffusion Model-based Correction and Super-Resolution	Wuzhe Xu et.al.	2505.08526	null
2025-05-12	H $^{\mathbf{3}}$ DP: Triply-Hierarchical Diffusion Policy for Visuomotor Learning	Yiyang Lu et.al.	2505.07819	null
2025-05-12	DanceGRPO: Unleashing GRPO on Visual Generation	Zeyue Xue et.al.	2505.07818	null
2025-05-12	Pixel Motion as Universal Representation for Robot Control	Kanchana Ranasinghe et.al.	2505.07817	null
2025-05-12	Continuous Visual Autoregressive Generation via Score Maximization	Chenze Shao et.al.	2505.07812	link
2025-05-12	Improving Trajectory Stitching with Flow Models	Reece O'Mahoney et.al.	2505.07802	null
2025-05-12	Learning Dynamics in Continual Pre-Training for Large Language Models	Xingjin Wang et.al.	2505.07796	null
2025-05-12	Synthesizing Diverse Network Flow Datasets with Scalable Dynamic Multigraph Generation	Arya Grayeli et.al.	2505.07777	null
2025-05-12	LAMM-ViT: AI Face Detection via Layer-Aware Modulation of Region-Guided Attention	Jiangling Zhang et.al.	2505.07734	null
2025-05-12	ShotAdapter: Text-to-Multi-Shot Video Generation with Diffusion Models	Ozgur Kara et.al.	2505.07652	null
2025-05-12	Markov Modelling Approach for Queues with Correlated Service Times -- the $M/M_D/2$ Model	Suman Thapa et.al.	2505.07648	null
2025-05-12	Diffused Responsibility: Analyzing the Energy Consumption of Generative Text-to-Audio Diffusion Models	Riccardo Passoni et.al.	2505.07615	null
2025-05-12	SecReEvalBench: A Multi-turned Security Resilience Evaluation Benchmark for Large Language Models	Huining Cui et.al.	2505.07584	null
2025-05-12	Noise Optimized Conditional Diffusion for Domain Adaptation	Lingkun Luo et.al.	2505.07548	null
2025-05-12	RAI: Flexible Agent Framework for Embodied AI	Kajetan Rachwał et.al.	2505.07532	link
2025-05-13	FLUXSynID: A Framework for Identity-Controlled Synthetic Face Generation with Document and Live Images	Raul Ismayilov et.al.	2505.07530	link
2025-05-09	Long time behaviour of Mean Field Games with fractional diffusion	Olav Ersland et.al.	2505.06183	null
2025-05-09	DiffLocks: Generating 3D Hair from a Single Image using Diffusion Models	Radu Alexandru Rosu et.al.	2505.06166	null
2025-05-09	Can Prompting LLMs Unlock Hate Speech Detection across Languages? A Zero-shot and Few-shot Study	Faeze Ghorbanpour et.al.	2505.06149	null
2025-05-09	Constraints to Lorentz violation and ultrahigh-energy electrons in D-foamy space-times	Chengyi Li et.al.	2505.06121	null
2025-05-09	Photovoltaic Defect Image Generator with Boundary Alignment Smoothing Constraint for Domain Shift Mitigation	Dongying Li et.al.	2505.06117	null
2025-05-09	FIC-TSC: Learning Time Series Classification with Fisher Information Constraint	Xiwen Chen et.al.	2505.06114	null
2025-05-09	Noise-Consistent Siamese-Diffusion for Medical Image Synthesis and Segmentation	Kunpeng Qiu et.al.	2505.06068	link
2025-05-09	Droplet Outbursts from Onion Cutting	Zixuan Wu et.al.	2505.06016	null
2025-05-09	Offline Multi-agent Reinforcement Learning via Score Decomposition	Dan Qiao et.al.	2505.05968	null
2025-05-09	GEORCE: A Fast New Control Algorithm for Computing Geodesics	Frederik Möbius Rygaard et.al.	2505.05961	link
2025-05-09	Summarisation of German Judgments in conjunction with a Class-based Evaluation	Bianca Steffes et.al.	2505.05947	link
2025-05-09	Autoencoder-Based Hybrid Replay for Class-Incremental Learning	Milad Khademi Nori et.al.	2505.05926	null
2025-05-09	A 3D pocket-aware and evolutionary conserved interaction guided diffusion model for molecular optimization	Anjie Qiao et.al.	2505.05874	null
2025-05-09	Screening Mechanisms on White Dwarfs: Symmetron & Dilaton	Joan Bachs-Esteban et.al.	2505.05871	null
2025-05-09	Generative Discovery of Partial Differential Equations by Learning from Math Handbooks	Hao Xu et.al.	2505.05869	null
2025-05-08	SVAD: From Single Image to 3D Avatar via Synthetic Data Generation with Video Diffusion and Data Augmentation	Yonwoo Choi et.al.	2505.05475	link
2025-05-08	3D Scene Generation: A Survey	Beichen Wen et.al.	2505.05474	link
2025-05-08	DiffusionSfM: Predicting Structure and Motion via Ray Origin and Endpoint Diffusion	Qitao Zhao et.al.	2505.05473	null
2025-05-08	Mogao: An Omni Foundation Model for Interleaved Multi-Modal Generation	Chao Liao et.al.	2505.05472	null
2025-05-08	Denoising Diffusion Probabilistic Models for Coastal Inundation Forecasting	Kazi Ashik Islam et.al.	2505.05381	null
2025-05-08	SDR-RDMA: Software-Defined Reliability Architecture for Planetary Scale RDMA Communication	Mikhail Khalilov et.al.	2505.05366	null
2025-05-08	Modelling and Verifying Neuronal Archetypes in Coq	Abdorrahim Bahrami et.al.	2505.05362	link
2025-05-08	SmartTrap: Automated Precision Experiments with Optical Tweezers	Martin Selin et.al.	2505.05290	null
2025-05-08	Diffusion Model Quantization: A Review	Qian Zeng et.al.	2505.05215	link
2025-05-08	EAM: Enhancing Anything with Diffusion Transformers for Blind Super-Resolution	Haizhen Xie et.al.	2505.05209	null
2025-05-08	Societal and technological progress as sewing an ever-growing, ever-changing, patchy, and polychrome quilt	Joel Z. Leibo et.al.	2505.05197	null
2025-05-08	Overcoming Dimensional Factorization Limits in Discrete Diffusion Models through Quantum Joint Distribution Learning	Chuangtao Chen et.al.	2505.05151	link
2025-05-08	Research on Anomaly Detection Methods Based on Diffusion Models	Yi Chen et.al.	2505.05137	null
2025-05-08	Taming OOD Actions for Offline Reinforcement Learning: An Advantage-Based Approach	Xuyang Chen et.al.	2505.05126	null
2025-05-08	MDAA-Diff: CT-Guided Multi-Dose Adaptive Attention Diffusion Model for PET Denoising	Xiaolong Niu et.al.	2505.05112	null
2025-05-07	Score Distillation Sampling for Audio: Source Separation, Synthesis, and Beyond	Jessie Richter-Powell et.al.	2505.04621	null
2025-05-08	Flexing RISC-V Instruction Subset Processors (RISPs) to Extreme Edge	Alireza Raisiardali et.al.	2505.04567	null
2025-05-07	Risk-sensitive Reinforcement Learning Based on Convex Scoring Functions	Shanyu Han et.al.	2505.04553	null
2025-05-07	Text2CT: Towards 3D CT Volume Generation from Free-text Descriptions Using Diffusion Model	Pengfei Guo et.al.	2505.04522	null
2025-05-08	HunyuanCustom: A Multimodal-Driven Architecture for Customized Video Generation	Teng Hu et.al.	2505.04512	null
2025-05-07	Detecting Spelling and Grammatical Anomalies in Russian Poetry Texts	Ilya Koziev et.al.	2505.04507	null
2025-05-07	Uncovering Key Features for Model-Driven Engineering of Complex Performance Indicators: A Scoping Review	Benito Giunta et.al.	2505.04498	null
2025-05-08	Defining and Quantifying Creative Behavior in Popular Image Generators	Aditi Ramaswamy et.al.	2505.04497	null
2025-05-07	Efficient Flow Matching using Latent Variables	Anirban Samaddar et.al.	2505.04486	null
2025-05-08	FA-KPConv: Introducing Euclidean Symmetries to KPConv via Frame Averaging	Ali Alawieh et.al.	2505.04485	null
2025-05-07	Miipher-2: A Universal Speech Restoration Model for Million-Hour Scale Data Restoration	Shigeki Karita et.al.	2505.04457	link
2025-05-07	An Asynchronous Distributed-Memory Parallel Algorithm for k-mer Counting	Souvadra Hati et.al.	2505.04431	link
2025-05-07	Recognizing Ornaments in Vocal Indian Art Music with Active Annotation	Sumit Kumar et.al.	2505.04419	null
2025-05-07	Localized Diffusion Models for High Dimensional Distributions Generation	Georg A. Gottwald et.al.	2505.04417	null
2025-05-07	The Aloe Family Recipe for Open and Specialized Healthcare LLMs	Dario Garcia-Gasulla et.al.	2505.04388	null
2025-05-06	FlexiAct: Towards Flexible Action Control in Heterogeneous Scenarios	Shiyi Zhang et.al.	2505.03730	null
2025-05-06	Demonstrating ViSafe: Vision-enabled Safety for High-speed Detect and Avoid	Parv Kapoor et.al.	2505.03694	null
2025-05-06	CaRaFFusion: Improving 2D Semantic Segmentation with Camera-Radar Point Cloud Fusion and Zero-Shot Image Inpainting	Huawei Sun et.al.	2505.03679	null
2025-05-06	Distribution-Conditional Generation: From Class Distribution to Creative Generation	Fu Feng et.al.	2505.03667	null
2025-05-06	Bounding Box-Guided Diffusion for Synthesizing Industrial Images and Segmentation Map	Alessandro Simoni et.al.	2505.03623	link
2025-05-07	PAHA: Parts-Aware Audio-Driven Human Animation with Diffusion Model	Y. B. Wang et.al.	2505.03603	null
2025-05-06	From Pixels to Polygons: A Survey of Deep Learning Approaches for Medical Image-to-Mesh Reconstruction	Fengming Lin et.al.	2505.03599	null
2025-05-06	Real-Time Person Image Synthesis Using a Flow Matching Model	Jiwoo Jeong et.al.	2505.03562	null
2025-05-06	A Comprehensive Survey of Large AI Models for Future Communications: Foundations, Applications and Challenges	Feibo Jiang et.al.	2505.03556	link
2025-05-06	Efficient Training of Physics-enhanced Neural ODEs via Direct Collocation and Nonlinear Programming	Linus Langenkamp et.al.	2505.03552	null
2025-05-06	Causal Intervention Framework for Variational Auto Encoder Mechanistic Interpretability	Dip Roy et.al.	2505.03530	null
2025-05-06	Modality-Guided Dynamic Graph Fusion and Temporal Diffusion for Self-Supervised RGB-T Tracking	Shenglan Li et.al.	2505.03507	link
2025-05-06	A new membership inference attack that spots memorization in generative and predictive models: Loss-Based with Reference Model algorithm (LBRM)	Faiz Taleb et.al.	2505.03490	null
2025-05-06	Wasserstein Convergence of Score-based Generative Models under Semiconvexity and Discontinuous Gradients	Stefano Bruno et.al.	2505.03432	null
2025-05-06	Phenotype-Guided Generative Model for High-Fidelity Cardiac MRI Synthesis: Advancing Pretraining and Clinical Applications	Ziyu Li et.al.	2505.03426	null
2025-05-05	Towards Dataset Copyright Evasion Attack against Personalized Text-to-Image Diffusion Models	Kuofeng Gao et.al.	2505.02824	link
2025-05-05	MUSAR: Exploring Multi-Subject Customization from Single-Subject Dataset via Attention Routing	Zinan Guo et.al.	2505.02823	link
2025-05-05	Advancing Generalizable Tumor Segmentation with Anomaly-Aware Open-Vocabulary Attention Maps and Frozen Foundation Diffusion Models	Yankai Jiang et.al.	2505.02753	link
2025-05-05	The use of Artificial Intelligence for Intervention and Assessment in Individuals with ASD	Aggeliki Sideraki et.al.	2505.02747	null
2025-05-05	Voila: Voice-Language Foundation Models for Real-Time Autonomous Interaction and Voice Role-Play	Yemin Shi et.al.	2505.02707	link
2025-05-05	Hierarchical random measures without tables	Marta Catalano et.al.	2505.02653	null
2025-05-06	MCCD: Multi-Agent Collaboration-based Compositional Diffusion for Complex Text-to-Image Generation	Mingcheng Li et.al.	2505.02648	null
2025-05-05	Towards Cross-Modality Modeling for Time Series Analytics: A Survey in the LLM Era	Chenxi Liu et.al.	2505.02583	link
2025-05-05	Unified Multimodal Understanding and Generation Models: Advances, Challenges, and Opportunities	Xinjie Zhang et.al.	2505.02567	link
2025-05-05	Bielik v3 Small: Technical Report	Krzysztof Ociepa et.al.	2505.02550	null
2025-05-06	Resolving Memorization in Empirical Diffusion Model for Manifold Data in High-Dimensional Spaces	Yang Lyu et.al.	2505.02508	null
2025-05-05	Hypothesis testing and Stein's lemma in general probability theories with Euclidean Jordan algebra and its quantum realization	Kanta Sonoda et.al.	2505.02487	null
2025-05-05	Ming-Lite-Uni: Advancements in Unified Architecture for Natural Multimodal Interaction	Biao Gong et.al.	2505.02471	link
2025-05-05	Data Augmentation With Back translation for Low Resource languages: A case of English and Luganda	Richard Kimera et.al.	2505.02463	null
2025-05-05	Predicting the Dynamics of Complex System via Multiscale Diffusion Autoencoder	Ruikun Li et.al.	2505.02450	null
2025-05-02	GENMO: A GENeralist Model for Human MOtion	Jiefeng Li et.al.	2505.01425	null
2025-05-02	Computational, Data-Driven, and Physics-Informed Machine Learning Approaches for Microstructure Modeling in Metal Additive Manufacturing	D. Patel et.al.	2505.01424	null
2025-05-02	VIDSTAMP: A Temporally-Aware Watermark for Ownership and Integrity in Video Diffusion Models	Mohammadreza Teymoorianfard et.al.	2505.01406	link
2025-05-02	Provable Efficiency of Guidance in Diffusion Models for General Data Distribution	Gen Li et.al.	2505.01382	null
2025-05-02	Binamix -- A Python Library for Generating Binaural Audio Datasets	Dan Barry et.al.	2505.01369	link
2025-05-02	FreeInsert: Disentangled Text-Guided Object Insertion in 3D Gaussian Scene without Spatial Priors	Chenxi Li et.al.	2505.01322	null
2025-05-02	Model See Model Do: Speech-Driven Facial Animation with Style Control	Yifang Pan et.al.	2505.01319	null
2025-05-02	ViSA-Flow: Accelerating Robot Skill Learning via Large-Scale Video Semantic Action Flow	Changhe Chen et.al.	2505.01288	null
2025-05-02	Scoring-Assisted Generative Exploration for Proteins (SAGE-Prot): A Framework for Multi-Objective Protein Optimization via Iterative Sequence Generation and Evaluation	Hocheol Lim et.al.	2505.01277	link
2025-05-02	Enhancing Obsolescence Forecasting with Deep Generative Data Augmentation: A Semi-Supervised Framework for Low-Data Industrial Applications	Elie Saad et.al.	2505.01261	null
2025-05-05	Enabling Training-Free Semantic Communication Systems with Generative Diffusion Models	Shunpu Tang et.al.	2505.01209	null
2025-05-02	A Secured Triad of IoT, Machine Learning, and Blockchain for Crop Forecasting in Agriculture	Najmus Sakib Sizan et.al.	2505.01196	null
2025-05-02	A Combinatorial Proof of Universal Optimality for Computing a Planar Convex Hull	Ivor van der Hoog et.al.	2505.01194	null
2025-05-02	FreePCA: Integrating Consistency Information across Long-short Frames in Training-free Long Video Generation via Principal Component Analysis	Jiangtong Tan et.al.	2505.01172	link
2025-05-02	Retrieval-Augmented Generation in Biomedicine: A Survey of Technologies, Datasets, and Clinical Applications	Jiawei He et.al.	2505.01146	null
2025-05-01	Controllable Weather Synthesis and Removal with Video Diffusion Models	Chih-Hao Lin et.al.	2505.00704	null
2025-05-01	T2I-R1: Reinforcing Image Generation with Collaborative Semantic-level and Token-level CoT	Dongzhi Jiang et.al.	2505.00703	link
2025-05-01	GuideSR: Rethinking Guidance for One-Step High-Fidelity Diffusion-Based Super-Resolution	Aditya Arora et.al.	2505.00687	null
2025-05-01	Visual Trajectory Prediction of Vessels for Inland Navigation	Alexander Puzicha et.al.	2505.00599	null
2025-05-01	ParkDiffusion: Heterogeneous Multi-Agent Multi-Modal Trajectory Prediction for Automated Parking using Diffusion Models	Jiarong Wei et.al.	2505.00586	null
2025-05-01	Safety-Critical Traffic Simulation with Guided Latent Diffusion Model	Mingxing Peng et.al.	2505.00515	null
2025-05-01	A General Model for Linearly Polarized Optical Vector Beams	Jonathan Nichols et.al.	2505.00471	null
2025-05-01	A Neural Network Mode for PX4 on Embedded Flight Controllers	Sindre M. Hegre et.al.	2505.00432	link
2025-05-01	Over-the-Air Inference over Multi-hop MIMO Networks	Chenghong Bian et.al.	2505.00430	null
2025-05-01	Leveraging Pretrained Diffusion Models for Zero-Shot Part Assembly	Ruiyuan Zhang et.al.	2505.00426	null
2025-05-01	CSE-SFP: Enabling Unsupervised Sentence Representation Learning via a Single Forward Pass	Bowen Zhang et.al.	2505.00389	link
2025-05-01	Towards Lightweight Hyperspectral Image Super-Resolution with Depthwise Separable Dilated Convolutional Network	Usman Muhammad et.al.	2505.00374	link
2025-05-01	Denoising weak lensing mass maps with diffusion model: systematic comparison with generative adversarial network	Shohei D. Aoyama et.al.	2505.00345	null
2025-05-01	T2VPhysBench: A First-Principles Benchmark for Physical Consistency in Text-to-Video Generation	Xuyang Guo et.al.	2505.00337	null
2025-05-01	Quaternion Wavelet-Conditioned Diffusion Models for Image Super-Resolution	Luigi Sigillo et.al.	2505.00334	null
2025-04-30	ReVision: High-Quality, Low-Cost Video Generation with Explicit 3D Physics Modeling for Complex Motion and Interaction	Qihao Liu et.al.	2504.21855	null
2025-04-30	3D Stylization via Large Reconstruction Model	Ipek Oztas et.al.	2504.21836	null
2025-04-30	From Aesthetics to Human Preferences: Comparative Perspectives of Evaluating Text-to-Music Systems	Huan Zhang et.al.	2504.21815	null
2025-04-30	Anatomical Similarity as a New Metric to Evaluate Brain Generative Models	Bahram Jafrasteh et.al.	2504.21771	null
2025-04-30	MovementVR: An open-source tool for the study of motor control and learning in virtual reality	Cristina Rossi et.al.	2504.21696	null
2025-04-30	HoloTime: Taming Video Diffusion Models for Panoramic 4D Scene Generation	Haiyang Zhou et.al.	2504.21650	link
2025-04-30	Diffusion-based Adversarial Identity Manipulation for Facial Privacy Protection	Liqin Wang et.al.	2504.21646	null
2025-04-30	ODE and PDE models for COVID-19, with reinfection and vaccination process for Cameroon and Germany	Hamadjam Abboubakar et.al.	2504.21613	null
2025-04-30	Latent Feature-Guided Conditional Diffusion for High-Fidelity Generative Image Semantic Communication	Zehao Chen et.al.	2504.21577	null
2025-04-30	Generative AI in Financial Institution: A Global Survey of Opportunities, Threats, and Regulation	Bikash Saha et.al.	2504.21574	null
2025-04-30	FreeBeacon: Efficient Communication and Data Aggregation in Battery-Free IoT	Gaosheng Liu et.al.	2504.21571	null
2025-04-30	MagicPortrait: Temporally Consistent Face Reenactment with 3D Geometric Guidance	Mengting Wei et.al.	2504.21497	link
2025-04-30	DGSolver: Diffusion Generalist Solver with Universal Posterior Sampling for Image Restoration	Hebaixu Wang et.al.	2504.21487	link
2025-04-30	GarmentDiffusion: 3D Garment Sewing Pattern Generation with Multimodal Diffusion Transformers	Xinyu Li et.al.	2504.21476	null
2025-04-30	SimPRIVE: a Simulation framework for Physical Robot Interaction with Virtual Environments	Federico Nesti et.al.	2504.21454	null
2025-04-29	YoChameleon: Personalized Vision and Language Generation	Thao Nguyen et.al.	2504.20998	null
2025-04-29	TesserAct: Learning 4D Embodied World Models	Haoyu Zhen et.al.	2504.20995	null
2025-04-29	Trace-of-Thought: Enhanced Arithmetic Problem Solving via Reasoning Distillation From Large to Small Language Models	Tyler McDonald et.al.	2504.20946	null
2025-04-30	End-to-end Audio Deepfake Detection from RAW Waveforms: a RawNet-Based Approach with Cross-Dataset Evaluation	Andrea Di Pierno et.al.	2504.20923	link
2025-04-29	Evaluating Generative Models for Tabular Data: Novel Metrics and Benchmarking	Dayananda Herurkar et.al.	2504.20900	null
2025-04-29	The Leaderboard Illusion	Shivalika Singh et.al.	2504.20879	null
2025-04-29	AI-GenBench: A New Ongoing Benchmark for AI-Generated Image Detection	Lorenzo Pellegrini et.al.	2504.20865	null
2025-04-29	Universal language model with the intervention of quantum theory	D. -F. Qin et.al.	2504.20839	null
2025-04-29	SoccerDiffusion: Toward Learning End-to-End Humanoid Robot Soccer from Gameplay Recordings	Florian Vahl et.al.	2504.20808	null
2025-04-29	JTreeformer: Graph-Transformer via Latent-Diffusion Model for Molecular Generation	Ji Shi et.al.	2504.20770	null
2025-04-29	DDPS: Discrete Diffusion Posterior Sampling for Paths in Layered Graphs	Hao Luan et.al.	2504.20754	null
2025-04-29	Learning a General Model: Folding Clothing with Topological Dynamics	Yiming Liu et.al.	2504.20720	null
2025-04-29	What's Wrong with Your Synthetic Tabular Data? Using Explainable AI to Evaluate Generative Models	Jan Kapar et.al.	2504.20687	link
2025-04-29	DiffLiB: High-fidelity differentiable modeling of lithium-ion batteries and efficient gradient-based parameter identification	Weipeng Xu et.al.	2504.20674	link
2025-04-29	LDPoly: Latent Diffusion for Polygonal Road Outline Extraction in Large-Scale Topographic Mapping	Weiqin Jiao et.al.	2504.20645	null
2025-04-28	Shopformer: Transformer-Based Framework for Detecting Shoplifting via Human Pose	Narges Rashvand et.al.	2504.19970	null
2025-04-28	Warm-Starting QAOA with XY Mixers: A Novel Approach for Quantum-Enhanced Vehicle Routing Optimization	Rafael S. do Carmo et.al.	2504.19934	null
2025-04-28	CineVerse: Consistent Keyframe Synthesis for Cinematic Scene Composition	Quynh Phung et.al.	2504.19894	null
2025-04-28	Queue or lounge: strategic design for strategic customer	Riya Sultana et.al.	2504.19889	null
2025-04-28	DeeCLIP: A Robust and Generalizable Transformer-Based Framework for Detecting AI-Generated Images	Mamadou Keita et.al.	2504.19876	link
2025-04-28	CoherenDream: Boosting Holistic Text Coherence in 3D Generation via Multimodal Large Language Models Feedback	Chenhan Jiang et.al.	2504.19860	null
2025-04-28	Automated Generation of Precedence Graphs in Digital Value Chains for Automotive Production	Cornelius Hake et.al.	2504.19835	null
2025-04-28	Contextures: The Mechanism of Representation Learning	Runtian Zhai et.al.	2504.19792	null
2025-04-28	Heterophily-informed Message Passing	Haishan Wang et.al.	2504.19785	null
2025-04-28	Crafting a Personal Journaling Practice: Negotiating Ecosystems of Materials, Personal Context, and Community in Analog Journaling	Katherine Lin et.al.	2504.19767	null
2025-04-28	Lossy Beyond Diagonal Reconfigurable Intelligent Surfaces: Modeling and Optimization	Yiyang Peng et.al.	2504.19744	null
2025-04-28	RepText: Rendering Visual Text via Replicating	Haofan Wang et.al.	2504.19724	null
2025-04-28	$\texttt{SAGE}$ : A Generic Framework for LLM Safety Evaluation	Madhur Jindal et.al.	2504.19674	link
2025-04-28	Multimodal Conditioned Diffusive Time Series Forecasting	Chen Su et.al.	2504.19669	null
2025-04-28	Hardware/Software Co-Design of RISC-V Extensions for Accelerating Sparse DNNs on FPGAs	Muhammad Sabih et.al.	2504.19659	null
2025-04-25	Eval3D: Interpretable and Fine-grained Evaluation for 3D Generation	Shivam Duggal et.al.	2504.18509	null
2025-04-25	Action-Minimization Meets Generative Modeling: Efficient Transition Path Sampling with the Onsager-Machlup Functional	Sanjeev Raja et.al.	2504.18506	null
2025-04-25	LaRI: Layered Ray Intersections for Single-view 3D Geometric Reasoning	Rui Li et.al.	2504.18424	null
2025-04-25	HepatoGEN: Generating Hepatobiliary Phase MRI with Perceptual and Adversarial Models	Jens Hooge et.al.	2504.18405	null
2025-04-25	Paradigm shift on Coding Productivity Using GenAI	Liang Yu et.al.	2504.18404	null
2025-04-25	The Foundation for Developing an Exoskeleton for the Rehabilitation of Temporomandibular Disorders	Paul-Otto Müller et.al.	2504.18379	link
2025-04-25	Enhanced Sampling, Public Dataset and Generative Model for Drug-Protein Dissociation Dynamics	Maodong Li et.al.	2504.18367	null
2025-04-25	SSD-Poser: Avatar Pose Estimation with State Space Duality from Sparse Observations	Shuting Zhao et.al.	2504.18332	null
2025-04-25	STP4D: Spatio-Temporal-Prompt Consistent Modeling for Text-to-4D Gaussian Splatting	Yunze Deng et.al.	2504.18318	null
2025-04-25	Seeing Soundscapes: Audio-Visual Generation and Separation from Soundscapes Using Audio-Visual Separator	Minjae Kang et.al.	2504.18283	null
2025-04-25	TextTIGER: Text-based Intelligent Generation with Entity Prompt Refinement for Text-to-Image Generation	Shintaro Ozaki et.al.	2504.18269	null
2025-04-25	Efficient Single-Pass Training for Multi-Turn Reasoning	Ritesh Goru et.al.	2504.18246	null
2025-04-25	Optimizing Multi-Round Enhanced Training in Diffusion Models for Improved Preference Understanding	Kun Li et.al.	2504.18204	null
2025-04-25	Generative AI for Physical-Layer Authentication	Rui Meng et.al.	2504.18175	null
2025-04-25	Offline Learning of Controllable Diverse Behaviors	Mathieu Petitbois et.al.	2504.18160	null
2025-04-24	LiDPM: Rethinking Point Diffusion for Lidar Scene Completion	Tetiana Martyniuk et.al.	2504.17791	null
2025-04-24	Token-Shuffle: Towards High-Resolution Image Generation with Autoregressive Models	Xu Ma et.al.	2504.17789	null
2025-04-24	WI2easy: warm inflation dynamics made easy	Gabriel S. Rodrigues et.al.	2504.17760	null
2025-04-24	User Profiles: The Achilles' Heel of Web Browsers	Dolière Francis Somé et.al.	2504.17692	null
2025-04-24	DiMeR: Disentangled Mesh Reconstruction Model	Lutao Jiang et.al.	2504.17670	null
2025-04-24	polyGen: A Learning Framework for Atomic-level Polymer Structure Generation	Ayush Jain et.al.	2504.17656	null
2025-04-24	Beyond Labels: Zero-Shot Diabetic Foot Ulcer Wound Segmentation with Self-attention Diffusion Models and the Potential for Text-Guided Customization	Abderrachid Hamrani et.al.	2504.17628	null
2025-04-24	Likelihood-Free Variational Autoencoders	Chen Xu et.al.	2504.17622	null
2025-04-24	Enhancing CNNs robustness to occlusions with bioinspired filters for border completion	Catarina P. Coutinho et.al.	2504.17619	null
2025-04-24	Mitigating xApp conflicts for efficient network slicing in 6G O-RAN: a graph convolutional-based attention network approach	Sihem Bakri et.al.	2504.17590	null
2025-04-24	TileLang: A Composable Tiled Programming Model for AI Systems	Lei Wang et.al.	2504.17577	null
2025-04-24	ESDiff: Encoding Strategy-inspired Diffusion Model with Few-shot Learning for Color Image Inpainting	Junyan Zhang et.al.	2504.17524	null
2025-04-24	Unveiling Hidden Vulnerabilities in Digital Human Generation via Adversarial Attacks	Zhiying Li et.al.	2504.17457	null
2025-04-24	3DV-TON: Textured 3D-Guided Consistent Video Try-on via Diffusion Models	Min Wei et.al.	2504.17414	null
2025-04-24	DRC: Enhancing Personalized Image Generation via Disentangled Representation Composition	Yiyan Xu et.al.	2504.17349	null
2025-04-23	Generalized Neighborhood Attention: Multi-dimensional Sparse Attention at the Speed of Light	Ali Hassani et.al.	2504.16922	null
2025-04-23	DreamO: A Unified Framework for Image Customization	Chong Mou et.al.	2504.16915	null
2025-04-23	BadVideo: Stealthy Backdoor Attack against Text-to-Video Generation	Ruotong Wang et.al.	2504.16907	null
2025-04-23	Practical approaches for crystal structure predictions with inpainting generation and universal interatomic potentials	Peichen Zhong et.al.	2504.16893	null
2025-04-23	Situational Preparedness Dynamics for Sequential Tropical Cyclone Hazards	Tianle Duan et.al.	2504.16878	null
2025-04-23	Planning with Diffusion Models for Target-Oriented Dialogue Systems	Hanwen Du et.al.	2504.16858	null
2025-04-23	Physically Consistent Humanoid Loco-Manipulation using Latent Diffusion Models	Ilyass Taouil et.al.	2504.16843	null
2025-04-23	Snorkeling in dark waters: A longitudinal surface exploration of unique Tor Hidden Services (Extended Version)	Alfonso Rodriguez Barredo-Valenzuela et.al.	2504.16836	null
2025-04-23	Evaluating Autoencoders for Parametric and Invertible Multidimensional Projections	Frederik L. Dennig et.al.	2504.16831	null
2025-04-23	Advanced Chest X-Ray Analysis via Transformer-Based Image Descriptors and Cross-Model Attention Mechanism	Lakshita Agarwal et.al.	2504.16774	null
2025-04-23	How Effective are Generative Large Language Models in Performing Requirements Classification?	Waad Alhoshan et.al.	2504.16768	null
2025-04-23	Tri-FusionNet: Enhancing Image Description Generation with Transformer-based Fusion Network and Dual Attention Mechanism	Lakshita Agarwal et.al.	2504.16761	null
2025-04-23	Feature Mixing Approach for Detecting Intraoperative Adverse Events in Laparoscopic Roux-en-Y Gastric Bypass Surgery	Rupak Bose et.al.	2504.16749	null
2025-04-24	Simple Graph Contrastive Learning via Fractional-order Neural Diffusion Networks	Yanan Zhao et.al.	2504.16748	null
2025-04-23	MOSAIC: A Skill-Centric Algorithmic Framework for Long-Horizon Manipulation Planning	Itamar Mishani et.al.	2504.16738	null
2025-04-22	Survey of Video Diffusion Models: Foundations, Implementations, and Applications	Yimu Wang et.al.	2504.16081	link
2025-04-22	From Reflection to Perfection: Scaling Inference-Time Optimization for Text-to-Image Diffusion Models via Reflection Tuning	Le Zhuo et.al.	2504.16080	null
2025-04-22	Intent-aware Diffusion with Contrastive Learning for Sequential Recommendation	Yuanpeng Qu et.al.	2504.16077	null
2025-04-22	High-performance training and inference for deep equivariant interatomic potentials	Chuin Wei Tan et.al.	2504.16068	null
2025-04-22	Boosting Generative Image Modeling via Joint Image-Feature Synthesis	Theodoros Kouzelis et.al.	2504.16064	null
2025-04-22	Evaluating Vision Language Models (VLMs) for Radiology: A Comprehensive Analysis	Frank Li et.al.	2504.16047	null
2025-04-22	Efficient Temporal Consistency in Diffusion-Based Video Editing with Adaptor Modules: A Theoretical Framework	Xinyuan Song et.al.	2504.16016	null
2025-04-22	Deep learning of point processes for modeling high-frequency data	Yoshihiro Gyotoku et.al.	2504.15944	null
2025-04-22	Adversarial Observations in Weather Forecasting	Erik Imgrund et.al.	2504.15942	null
2025-04-22	Text-based Animatable 3D Avatars with Morphable Model Alignment	Yiqian Wu et.al.	2504.15835	null
2025-04-22	Satellite to GroundScape -- Large-scale Consistent Ground View Generation from Satellite Views	Ningli Xu et.al.	2504.15786	null
2025-04-22	Clifford Group Equivariant Diffusion Models for 3D Molecular Generation	Cong Liu et.al.	2504.15773	null
2025-04-22	Stochastic Programming for Dynamic Temperature Control of Refrigerated Road Transport	Francesco Giliberto et.al.	2504.15741	null
2025-04-22	Riemannian Neural Geodesic Interpolant	Jiawen Wu et.al.	2504.15736	null
2025-04-22	Structure-Preserving Zero-Shot Image Editing via Stage-Wise Latent Injection in Diffusion Models	Dasol Jeong et.al.	2504.15723	null
2025-04-21	Roll the dice & look before you leap: Going beyond the creative limits of next-token prediction	Vaishnavh Nagarajan et.al.	2504.15266	link
2025-04-21	Bringing Diversity from Diffusion Models to Semantic-Guided Face Asset Generation	Yunxuan Cai et.al.	2504.15259	null
2025-04-21	Evaluating Judges as Evaluators: The JETTS Benchmark of LLM-as-Judges as Test-Time Scaling Evaluators	Yilun Zhou et.al.	2504.15253	link
2025-04-21	DRAGON: Distributional Rewards Optimize Diffusion Generative Models	Yatong Bai et.al.	2504.15217	null
2025-04-21	Integrating Symbolic Execution into the Fine-Tuning of Code-Generating LLMs	Marina Sakharova et.al.	2504.15210	null
2025-04-21	Tiger200K: Manually Curated High Visual Quality Video Dataset from UGC Platform	Xianpan Zhou et.al.	2504.15182	null
2025-04-21	FaceCraft4D: Animated 3D Facial Avatar Generation from a Single Image	Fei Yin et.al.	2504.15179	null
2025-04-21	DSPO: Direct Semantic Preference Optimization for Real-World Image Super-Resolution	Miaomiao Cai et.al.	2504.15176	null
2025-04-21	Automatic Generation of Aerobatic Flight in Complex Environments via Diffusion Models	Yuhang Zhong et.al.	2504.15138	null
2025-04-21	Robust and Real-time Surface Normal Estimation from Stereo Disparities using Affine Transformations	Csongor Csanad Kariko et.al.	2504.15121	null
2025-04-22	VistaDepth: Frequency Modulation With Bias Reweighting For Enhanced Long-Range Depth Estimation	Mingxia Zhan et.al.	2504.15095	null
2025-04-21	Generative Artificial Intelligence for Beamforming in Low-Altitude Economy	Geng Sun et.al.	2504.15079	null
2025-04-21	SOLIDO: A Robust Watermarking Method for Speech Synthesis via Low-Rank Adaptation	Yue Li et.al.	2504.15035	null
2025-04-21	Gaussian Shading++: Rethinking the Realistic Deployment Challenge of Performance-Lossless Image Watermark for Diffusion Models	Zijin Yang et.al.	2504.15026	null
2025-04-21	PIV-FlowDiffuser:Transfer-learning-based denoising diffusion models for PIV	Qianyu Zhu et.al.	2504.14952	link
2025-04-18	Decoding Vision Transformers: the Diffusion Steering Lens	Ryota Takatsuki et.al.	2504.13763	link
2025-04-18	ESPLoRA: Enhanced Spatial Precision with Low-Rank Adaption in Text-to-Image Diffusion Models for High-Definition Synthesis	Andrea Rigo et.al.	2504.13745	null
2025-04-18	MLEP: Multi-granularity Local Entropy Patterns for Universal AI-generated Image Detection	Lin Yuan et.al.	2504.13726	null
2025-04-18	Magnecko: Design and Control of a Quadrupedal Magnetic Climbing Robot	Stefan Leuthard et.al.	2504.13672	null
2025-04-18	Word Embedding Techniques for Classification of Star Ratings	Hesham Abdelmotaleb et.al.	2504.13653	null
2025-04-18	Simulating Before Planning: Constructing Intrinsic User World Model for User-Tailored Dialogue Policy Planning	Tao He et.al.	2504.13643	null
2025-04-18	SupResDiffGAN a new approach for the Super-Resolution task	Dawid Kopeć et.al.	2504.13622	null
2025-04-18	Entropic Time Schedulers for Generative Diffusion Models	Dejan Stancevic et.al.	2504.13612	null
2025-04-18	WeatherGen: A Unified Diverse Weather Generator for LiDAR Point Clouds via Spider Mamba Diffusion	Yang Wu et.al.	2504.13561	link
2025-04-18	Task Assignment and Exploration Optimization for Low Altitude UAV Rescue via Generative AI Enhanced Multi-agent Reinforcement Learning	Xin Tang et.al.	2504.13554	null
2025-04-18	Beyond One-Hot Labels: Semantic Mixing for Model Calibration	Haoyang Luo et.al.	2504.13548	link
2025-04-18	Enhancing Multilingual Sentiment Analysis with Explainability for Sinhala, English, and Code-Mixed Content	Azmarah Rizvi et.al.	2504.13545	null
2025-04-18	MusFlow: Multimodal Music Generation via Conditional Flow Matching	Jiahao Song et.al.	2504.13535	null
2025-04-18	U-Shape Mamba: State Space Model for faster diffusion	Alex Ergasti et.al.	2504.13499	link
2025-04-18	Early Timestep Zero-Shot Candidate Selection for Instruction-Guided Image Editing	Joowon Kim et.al.	2504.13490	null
2025-04-17	Aligning Constraint Generation with Design Intent in Parametric CAD	Evan Casey et.al.	2504.13178	null
2025-04-17	SemCORE: A Semantic-Enhanced Generative Cross-Modal Retrieval Framework with MLLMs	Haoxuan Li et.al.	2504.13172	null
2025-04-17	Personalized Text-to-Image Generation with Auto-Regressive Models	Kaiyue Sun et.al.	2504.13162	link
2025-04-17	Science-T2I: Addressing Scientific Illusions in Image Synthesis	Jialuo Li et.al.	2504.13129	null
2025-04-17	UniEdit-Flow: Unleashing Inversion and Editing in the Era of Flow Models	Guanlong Jiao et.al.	2504.13109	null
2025-04-17	RF-DETR Object Detection vs YOLOv12 : A Study of Transformer-based and CNN-based Architectures for Single-Class and Multi-Class Greenfruit Detection in Complex Orchard Environments Under Label Ambiguity	Ranjan Sapkota et.al.	2504.13099	null
2025-04-17	An All-Atom Generative Model for Designing Protein Complexes	Ruizhe Chen et.al.	2504.13075	null
2025-04-18	SkyReels-V2: Infinite-length Film Generative Model	Guibin Chen et.al.	2504.13074	link
2025-04-17	ArtistAuditor: Auditing Artist Style Pirate in Text-to-Image Generation Models	Linkang Du et.al.	2504.13061	link
2025-04-17	Design Topological Materials by Reinforcement Fine-Tuned Generative Model	Haosheng Xu et.al.	2504.13048	null
2025-04-17	Evidence for sulfur chemistry in the atmosphere of the warm sub-Neptune TOI-270 d	Lukas Felix et.al.	2504.13039	null
2025-04-17	TTRD3: Texture Transfer Residual Denoising Dual Diffusion Model for Remote Sensing Image Super-Resolution	Yide Liu et.al.	2504.13026	link
2025-04-17	GSAC: Leveraging Gaussian Splatting for Photorealistic Avatar Creation with Unity Integration	Rendong Zhang et.al.	2504.12999	link
2025-04-17	QLLM: Do We Really Need a Mixing Network for Credit Assignment in Multi-Agent Reinforcement Learning?	Zhouyang Jiang et.al.	2504.12961	null
2025-04-17	Systemic risk mitigation in supply chains through network rewiring	Giacomo Zelbi et.al.	2504.12955	null
2025-04-16	VGDFR: Diffusion-based Video Generation with Dynamic Latent Frame Rate	Zhihang Yuan et.al.	2504.12259	link
2025-04-16	Cobra: Efficient Line Art COlorization with BRoAder References	Junhao Zhuang et.al.	2504.12240	null
2025-04-16	Coding-Prior Guided Diffusion Network for Video Deblurring	Yike Liu et.al.	2504.12222	null
2025-04-16	Validating and monitoring bibliographic and citation data in OpenCitations collections	Ivan Heibi et.al.	2504.12195	null
2025-04-16	Deep Generative Models for Bayesian Inference on High-Rate Sensor Data: Applications in Automotive Radar and Medical Imaging	Tristan S. W. Stevens et.al.	2504.12154	null
2025-04-16	Anti-Aesthetics: Protecting Facial Privacy against Customized Text-to-Image Synthesis	Songping Wang et.al.	2504.12129	null
2025-04-16	A Diffusion-Based Framework for Terrain-Aware Remote Sensing Image Reconstruction	Zhenyu Yu et.al.	2504.12112	null
2025-04-16	Generalized Visual Relation Detection with Diffusion Models	Kaifeng Gao et.al.	2504.12100	null
2025-04-16	Generative Deep Learning Framework for Inverse Design of Fuels	Kiran K. Yalamanchi et.al.	2504.12075	null
2025-04-16	Modular-Cam: Modular Dynamic Camera-view Video Generation with LLM	Zirui Pan et.al.	2504.12048	null
2025-04-17	Understanding Attention Mechanism in Video Diffusion Models	Bingyan Liu et.al.	2504.12027	null
2025-04-16	Instruction-augmented Multimodal Alignment for Image-Text and Element Matching	Xinli Yue et.al.	2504.12018	null
2025-04-17	Dual-Energy Cone-Beam CT Using Two Orthogonal Projection Views: A Phantom Study	Junbo Peng et.al.	2504.12010	null
2025-04-16	Generative Recommendation with Continuous-Token Diffusion	Haohao Qu et.al.	2504.12007	null
2025-04-16	R-Meshfusion: Reinforcement Learning Powered Sparse-View Mesh Reconstruction with Diffusion Priors	Haoyang Wang et.al.	2504.11946	null
2025-04-15	Aligning Generative Denoising with Discriminative Objectives Unleashes Diffusion for Visual Perception	Ziqi Pang et.al.	2504.11457	link
2025-04-16	Elucidating the Design Space of Multimodal Protein Language Models	Cheng-Yen Hsieh et.al.	2504.11454	null
2025-04-16	Diffusion Distillation With Direct Preference Optimization For Efficient 3D LiDAR Scene Completion	An Zhao et.al.	2504.11447	link
2025-04-15	NormalCrafter: Learning Temporally Consistent Normals from Video Diffusion Priors	Yanrui Bin et.al.	2504.11427	null
2025-04-15	ADT: Tuning Diffusion Models with Adversarial Supervision	Dazhong Shen et.al.	2504.11423	null
2025-04-15	VideoPanda: Video Panoramic Diffusion with Multi-view Attention	Kevin Xie et.al.	2504.11389	null
2025-04-15	Ring Artifacts Correction Based on Global-Local Features Interaction Guidance in the Projection Domain	Yunze Liu et.al.	2504.11375	null
2025-04-15	Evaluating DAO Sustainability and Longevity Through On-Chain Governance Metrics	Silvio Meneguzzo et.al.	2504.11341	null
2025-04-15	Autoregressive Distillation of Diffusion Transformers	Yeongmin Kim et.al.	2504.11295	link
2025-04-15	DeepSelective: Feature Gating and Representation Matching for Interpretable Clinical Prediction	Ruochi Zhang et.al.	2504.11264	null
2025-04-15	VEXP: A Low-Cost RISC-V ISA Extension for Accelerated Softmax Computation in Transformers	Run Wang et.al.	2504.11227	null
2025-04-15	Focal Split: Untethered Snapshot Depth from Differential Defocus	Junjie Luo et.al.	2504.11202	null
2025-04-15	DMAGaze: Gaze Estimation Based on Feature Disentanglement and Multi-Scale Attention	Haohan Chen et.al.	2504.11160	null
2025-04-15	SAR-to-RGB Translation with Latent Diffusion for Earth Observation	Kaan Aydin et.al.	2504.11154	null
2025-04-15	Taming Consistency Distillation for Accelerated Human Image Animation	Xiang Wang et.al.	2504.11143	null
2025-04-14	REPA-E: Unlocking VAE for End-to-End Tuning with Latent Diffusion Transformers	Xingjian Leng et.al.	2504.10483	null
2025-04-14	Online Advanced Labs in Physics	Peter A. Bennett et.al.	2504.10470	null
2025-04-14	Art3D: Training-Free 3D Generation from Flat-Colored Illustration	Xiaoyan Cong et.al.	2504.10466	null
2025-04-14	Anchor Token Matching: Implicit Structure Locking for Training-free AR Image Editing	Taihang Hu et.al.	2504.10434	link
2025-04-14	MonoDiff9D: Monocular Category-Level 9D Object Pose Estimation via Diffusion Model	Jian Liu et.al.	2504.10433	link
2025-04-14	AI-Driven Code Refactoring: Using Graph Neural Networks to Enhance Software Maintainability	Gopichand Bandarupalli et.al.	2504.10412	null
2025-04-14	LLM-driven Constrained Copy Generation through Iterative Refinement	Varun Vasudevan et.al.	2504.10391	null
2025-04-14	Improving diffusion modeling in all-solid-state lithium batteries: a novel approach for grain boundary effects	Lena Scholz et.al.	2504.10348	null
2025-04-14	$α$ -Flow: A Unified Framework for Continuous-State Discrete Flow Matching Models	Chaoran Cheng et.al.	2504.10283	null
2025-04-14	DiffMOD: Progressive Diffusion Point Denoising for Moving Object Detection in Remote Sensing	Jinyue Zhang et.al.	2504.10278	null
2025-04-14	When Technologies Are Not Enough: Understanding How Domestic Workers Employ (and Avoid) Online Technologies in Their Work Practices	Mariana Fernandez-Espinosa et.al.	2504.10265	null
2025-04-14	A Model Zoo of Vision Transformers	Damian Falk et.al.	2504.10231	link
2025-04-14	Localized Cultural Knowledge is Conserved and Controllable in Large Language Models	Veniamin Veselovsky et.al.	2504.10191	null
2025-04-14	Efficient Generative Model Training via Embedded Representation Warmup	Deyuan Liu et.al.	2504.10188	link
2025-04-14	A New Paradigm in IBR Modeling for Power Flow and Short Circuit Analysis	Zahid Javid et.al.	2504.10181	null
2025-04-11	Seaweed-7B: Cost-Effective Training of Video Generation Foundation Model	Team Seawead et.al.	2504.08685	null
2025-04-11	Safe Flow Matching: Robot Motion Planning with Control Barrier Functions	Xiaobing Dai et.al.	2504.08661	null
2025-04-11	Training-free Guidance in Text-to-Video Generation via Multimodal Planning and Structured Noise Initialization	Jialu Li et.al.	2504.08641	null
2025-04-11	Quantum Fluctuation-enhanced Milli-Kelvin Magnetic Refrigeration in Triangular Lattice Magnet GdBO3	Weijie Lin et.al.	2504.08636	null
2025-04-11	Discretization Error Analysis of a High Order Unfitted Space-Time Method for moving domain problems	Fabian Heimann et.al.	2504.08608	null
2025-04-11	Neural Fidelity Calibration for Informative Sim-to-Real Adaptation	Youwei Yu et.al.	2504.08604	null
2025-04-11	ZipIR: Latent Pyramid Diffusion Transformer for High-Resolution Image Restoration	Yongsheng Yu et.al.	2504.08591	null
2025-04-11	COP-GEN-Beta: Unified Generative Modelling of COPernicus Imagery Thumbnails	Miguel Espinosa et.al.	2504.08548	null
2025-04-11	Slicing the Gaussian Mixture Wasserstein Distance	Moritz Piening et.al.	2504.08544	link
2025-04-11	Discriminator-Free Direct Preference Optimization for Video Diffusion	Haoran Cheng et.al.	2504.08542	null
2025-04-11	On The Landscape of Spoken Language Models: A Comprehensive Survey	Siddhant Arora et.al.	2504.08528	null
2025-04-11	TickIt: Leveraging Large Language Models for Automated Ticket Escalation	Fengrui Liu et.al.	2504.08475	null
2025-04-11	Cut-and-Splat: Leveraging Gaussian Splatting for Synthetic Data Generation	Bram Vanherle et.al.	2504.08473	link
2025-04-11	On the Design of Diffusion-based Neural Speech Codecs	Pietro Foti et.al.	2504.08470	null
2025-04-11	Muon-Accelerated Attention Distillation for Real-Time Edge Synthesis via Optimized Latent Diffusion	Weiye Chen et.al.	2504.08451	link
2025-04-10	PixelFlow: Pixel-Space Generative Models with Flow	Shoufa Chen et.al.	2504.07963	link
2025-04-10	Geo4D: Leveraging Video Generators for Geometric 4D Scene Reconstruction	Zeren Jiang et.al.	2504.07961	link
2025-04-10	VisualCloze: A Universal Image Generation Framework via Visual In-Context Learning	Zhong-Yu Li et.al.	2504.07960	null
2025-04-10	Activating high-power parametric oscillation in photonic-crystal resonators	Grant M. Brodnik et.al.	2504.07947	null
2025-04-10	GenEAva: Generating Cartoon Avatars with Fine-Grained Facial Expressions from Realistic Diffusion-based Faces	Hao Yu et.al.	2504.07945	null
2025-04-10	Echo: An Open-Source, Low-Cost Teleoperation System with Force Feedback for Dataset Collection in Robot Learning	Artem Bazhenov et.al.	2504.07939	null
2025-04-10	Optimal Control For Anti-Abeta Treatment in Alzheimer's Disease using a Reaction-Diffusion Model	Wenrui Hao et.al.	2504.07913	null
2025-04-10	DiverseFlow: Sample-Efficient Diverse Mode Coverage in Flows	Mashrur M. Morshed et.al.	2504.07894	null
2025-04-10	QubitHammer Attacks: Qubit Flipping Attacks in Multi-tenant Superconducting Quantum Computers	Yizhuo Tan et.al.	2504.07875	null
2025-04-11	Pangu Ultra: Pushing the Limits of Dense Large Language Models on Ascend NPUs	Yichun Yin et.al.	2504.07866	null
2025-04-10	A Review of HPC-Accelerated CFD in National Security and Defense	James Afful et.al.	2504.07837	null
2025-04-10	The ISC Creator: Human-Centered Design of Learning Analytics Interactive Indicator Specification Cards	Shoeb Joarder et.al.	2504.07811	null
2025-04-10	Revisiting Likelihood-Based Out-of-Distribution Detection by Modeling Representations	Yifan Ding et.al.	2504.07793	link
2025-04-10	Characterization of the Electronic Noise in the Readout of Resistive Micromegas in the High-Angle Time Projection Chambers of the T2K Experiment	D. Attié et.al.	2504.07759	null
2025-04-10	Virtual-mask Informed Prior for Sparse-view Dual-Energy CT Reconstruction	Zini Chen et.al.	2504.07753	null
2025-04-09	Identifying Unknown Stochastic Dynamics via Finite expression methods	Senwei Liang et.al.	2504.07085	null
2025-04-09	Evaluating Retrieval Augmented Generative Models for Document Queries in Transportation Safety	Chad Melton et.al.	2504.07022	null
2025-04-09	Latent Diffusion U-Net Representations Contain Positional Embeddings and Anomalies	Jonas Loos et.al.	2504.07008	link
2025-04-09	A Comparison of Deep Learning Methods for Cell Detection in Digital Cytology	Marco Acerbis et.al.	2504.06957	link
2025-04-09	PathSegDiff: Pathology Segmentation using Diffusion model representations	Sachin Kumar Danisetty et.al.	2504.06950	null
2025-04-09	The Importance of Being Discrete: Measuring the Impact of Discretization in End-to-End Differentially Private Synthetic Data	Georgi Ganev et.al.	2504.06923	null
2025-04-09	Data Augmentation for Fake Reviews Detection in Multiple Languages and Multiple Domains	Ming Liu et.al.	2504.06917	null
2025-04-09	MedSegFactory: Text-Guided Generation of Medical Image-Mask Pairs	Jiawei Mao et.al.	2504.06897	null
2025-04-09	EIDT-V: Exploiting Intersections in Diffusion Trajectories for Model-Agnostic, Zero-Shot, Training-Free Text-to-Video Generation	Diljeet Jagpal et.al.	2504.06861	null
2025-04-09	CasTex: Cascaded Text-to-Texture Synthesis via Explicit Texture Maps and Physically-Based Shading	Mishan Aliev et.al.	2504.06856	null
2025-04-09	Open Problems and a Hypothetical Path Forward in LLM Knowledge Paradigms	Xiaotian Ye et.al.	2504.06823	null
2025-04-09	DyDiT++: Dynamic Diffusion Transformers for Efficient Visual Generation	Wangbo Zhao et.al.	2504.06803	link
2025-04-09	A Meaningful Perturbation Metric for Evaluating Explainability Methods	Danielle Cohen et.al.	2504.06800	null
2025-04-09	FedMerge: Federated Personalization via Model Merging	Shutong Chen et.al.	2504.06768	null
2025-04-09	DIMA: DIffusing Motion Artifacts for unsupervised correction in brain MRI images	Paolo Angella et.al.	2504.06767	null
2025-04-08	OmniSVG: A Unified Scalable Vector Graphics Generation Model	Yiying Yang et.al.	2504.06263	null
2025-04-08	Transfer between Modalities with MetaQueries	Xichen Pan et.al.	2504.06256	null
2025-04-08	Electronic Structure Guided Inverse Design Using Generative Models	Shuyi Jia et.al.	2504.06249	link
2025-04-08	From 128K to 4M: Efficient Training of Ultra-Long Context Large Language Models	Chejian Xu et.al.	2504.06214	null
2025-04-08	WoundAmbit: Bridging State-of-the-Art Semantic Segmentation and Real-World Wound Care	Vanessa Borst et.al.	2504.06185	null
2025-04-08	Deploying Chatbots in Customer Service: Adoption Hurdles and Simple Remedies	Evgeny Kagan et.al.	2504.06145	null
2025-04-08	QGen Studio: An Adaptive Question-Answer Generation, Training and Evaluation Platform	Movina Moses et.al.	2504.06136	null
2025-04-08	FaceCloak: Learning to Protect Face Templates	Sudipta Banerjee et.al.	2504.06131	null
2025-04-08	OSDM-MReg: Multimodal Image Registration based One Step Diffusion Model	Xiaochen Wei et.al.	2504.06027	null
2025-04-08	CamContextI2V: Context-aware Controllable Video Generation	Luis Denninger et.al.	2504.06022	link
2025-04-08	Note on the Universality of Parameterized IQP Circuits with Hidden Units for Generating Probability Distributions	Andrii Kurkin et.al.	2504.05997	null
2025-04-08	An Empirical Study of GPT-4o Image Generation Capabilities	Sixiang Chen et.al.	2504.05979	link
2025-04-08	Diffusion Based Ambiguous Image Segmentation	Jakob Lønborg Christensen et.al.	2504.05977	null
2025-04-08	Adaptive Extended Kalman Filtering for Battery State of Charge Estimation on STM32	António Barros et.al.	2504.05936	null
2025-04-08	Pushing JWST to the extremes: search and scrutiny of bright galaxy candidates at z $\simeq$ 15-30	M. Castellano et.al.	2504.05893	null
2025-04-07	CREA: A Collaborative Multi-Agent Framework for Creative Content Generation with Diffusion Models	Kavana Venkatesh et.al.	2504.05306	null
2025-04-07	Gaussian Mixture Flow Matching Models	Hansheng Chen et.al.	2504.05304	link
2025-04-07	Dimension-Free Convergence of Diffusion Models for Approximate Gaussian Mixtures	Gen Li et.al.	2504.05300	null
2025-04-07	Unleashing the Power of LLMs in Dense Retrieval with Query Likelihood Modeling	Hengran Zhang et.al.	2504.05216	null
2025-04-07	P2Mark: Plug-and-play Parameter-intrinsic Watermarking for Neural Speech Generation	Yong Ren et.al.	2504.05197	null
2025-04-07	Learning symmetries in datasets	Veronica Sanz et.al.	2504.05174	null
2025-04-07	DDPM Score Matching and Distribution Learning	Sinho Chewi et.al.	2504.05161	null
2025-04-07	DA2Diff: Exploring Degradation-aware Adaptive Diffusion Priors for All-in-One Weather Restoration	Jiamei Xiong et.al.	2504.05135	null
2025-04-07	Graph-based Diffusion Model for Collaborative Filtering	Xuan Zhang et.al.	2504.05029	null
2025-04-07	RS-RAG: Bridging Remote Sensing Imagery and Comprehensive Knowledge with a Multi-Modal Dataset and Retrieval-Augmented Generation Model	Congcong Wen et.al.	2504.04988	null
2025-04-07	Low-Rate Semantic Communication with Codebook-based Conditional Generative Models	Kailang Ye et.al.	2504.04977	null
2025-04-08	REWIND: Real-Time Egocentric Whole-Body Motion Diffusion with Exemplar-Based Identity Conditioning	Jihyun Lee et.al.	2504.04956	null
2025-04-07	A Unified Pairwise Framework for RLHF: Bridging Generative Reward Modeling and Policy Optimization	Wenyuan Xu et.al.	2504.04950	null
2025-04-07	One Quantizer is Enough: Toward a Lightweight Audio Codec	Linwei Zhai et.al.	2504.04949	link
2025-04-07	Video-Bench: Human-Aligned Video Generation Benchmark	Hui Han et.al.	2504.04907	null
2025-04-04	MME-Unify: A Comprehensive Benchmark for Unified Multimodal Understanding and Generation Models	Wulin Xie et.al.	2504.03641	null
2025-04-04	Enhancing Causal Effect Estimation with Diffusion-Generated Data	Li Chen et.al.	2504.03630	null
2025-04-04	Quantifying the uncertainty of model-based synthetic image quality metrics	Ciaran Bench et.al.	2504.03623	null
2025-04-04	VISTA-OCR: Towards generative and interactive end to end OCR models	Laziz Hamdi et.al.	2504.03621	null
2025-04-04	Autonomous and Self-Adapting System for Synthetic Media Detection and Attribution	Aref Azizpour et.al.	2504.03615	null
2025-04-04	Multimodal Diffusion Bridge with Attention-Based SAR Fusion for Satellite Image Cloud Removal	Yuyang Hu et.al.	2504.03607	null
2025-04-04	HumanDreamer-X: Photorealistic Single-image Human Avatars Reconstruction via Gaussian Restoration	Boyuan Wang et.al.	2504.03536	null
2025-04-04	Diffusion Active Learning: Towards Data-Driven Experimental Design in Computed Tomography	Luis Barba et.al.	2504.03491	null
2025-04-04	BUFF: Bayesian Uncertainty Guided Diffusion Probabilistic Model for Single Image Super-Resolution	Zihao He et.al.	2504.03490	null
2025-04-04	Structured Legal Document Generation in India: A Model-Agnostic Wrapper Approach with VidhikDastaavej	Shubham Kumar Nigam et.al.	2504.03486	null
2025-04-04	Dynamic Importance in Diffusion U-Net for Enhanced Image Synthesis	Xi Wang et.al.	2504.03471	link
2025-04-04	D-Garment: Physics-Conditioned Latent Diffusion for Dynamic Garment Deformations	Antoine Dumoulin et.al.	2504.03468	null
2025-04-04	Generating ensembles of spatially-coherent in-situ forecasts using flow matching	David Landry et.al.	2504.03463	null
2025-04-04	Conditioning Diffusions Using Malliavin Calculus	Jakiw Pidstrigach et.al.	2504.03461	null
2025-04-04	QuinID: Enabling FDMA-Based Fully Parallel RFID with Frequency-Selective Antenna	Xin Na et.al.	2504.03412	link
2025-04-03	Concept Lancet: Image Editing with Compositional Representation Transplant	Jinqi Luo et.al.	2504.02828	null
2025-04-03	Efficient Autoregressive Shape Generation via Octree-Based Adaptive Tokenization	Kangle Deng et.al.	2504.02817	null
2025-04-03	F-ViTA: Foundation Model Guided Visible to Thermal Translation	Jay N. Paranjape et.al.	2504.02801	link
2025-04-03	Scene Splatter: Momentum 3D Scene Generation from Single Image with Video Diffusion Model	Shengjun Zhang et.al.	2504.02764	null
2025-04-03	MD-ProjTex: Texturing 3D Shapes with Multi-Diffusion Projection	Ahmet Burak Yildirim et.al.	2504.02762	null
2025-04-03	Echoes of the hidden: Uncovering coordination beyond network structure	Shahar Somin et.al.	2504.02757	null
2025-04-04	RBT4DNN: Requirements-based Testing of Neural Networks	Nusrat Jahan Mozumder et.al.	2504.02737	link
2025-04-03	Pushing the Limit of PPG Sensing in Sedentary Conditions by Addressing Poor Skin-sensor Contact	Manh Pham Hung et.al.	2504.02735	null
2025-04-03	RoSMM: A Robust and Secure Multi-Modal Watermarking Framework for Diffusion Models	ZhongLi Fang et.al.	2504.02640	null
2025-04-03	Variational Online Mirror Descent for Robust Learning in Schrödinger Bridge	Dong-Sig Han et.al.	2504.02618	null
2025-04-03	Fine-Tuning Visual Autoregressive Models for Subject-Driven Generation	Jiwoo Chung et.al.	2504.02612	null
2025-04-03	Bridging the Gap between Gaussian Diffusion Models and Universal Quantization for Image Compression	Lucas Relic et.al.	2504.02579	null
2025-04-03	MAD: Makeup All-in-One with Cross-Domain Diffusion Model	Bo-Kai Ruan et.al.	2504.02545	null
2025-04-03	High Numerical Aperture Achromatic Meta-Devices through Dispersion Compensation	Yuzhong Wang et.al.	2504.02535	null
2025-04-04	ARCANE: Adaptive RISC-V Cache Architecture for Near-memory Extensions	Vincenzo Petrolo et.al.	2504.02533	null
2025-04-02	Diffusion-Guided Gaussian Splatting for Large-Scale Unconstrained 3D Reconstruction and Novel View Synthesis	Niluthpol Chowdhury Mithun et.al.	2504.01960	null
2025-04-03	VideoScene: Distilling Video Diffusion Model to Generate 3D Scenes in One Step	Hanyang Wang et.al.	2504.01956	null
2025-04-02	A Unified Approach to Analysis and Design of Denoising Markov Models	Yinuo Ren et.al.	2504.01938	null
2025-04-03	ILLUME+: Illuminating Unified MLLM with Dual Visual Tokenization and Diffusion Refinement	Runhui Huang et.al.	2504.01934	null
2025-04-02	Gen-C: Populating Virtual Worlds with Generative Crowds	Andreas Panayiotou et.al.	2504.01924	null
2025-04-03	Bridging the Linguistic Divide: A Survey on Leveraging Large Language Models for Machine Translation	Baban Gain et.al.	2504.01919	null
2025-04-02	Multi-fidelity Parameter Estimation Using Conditional Diffusion Models	Caroline Tatsuoka et.al.	2504.01894	null
2025-04-02	A Diffusion-Based Framework for Occluded Object Movement	Zheng-Peng Duan et.al.	2504.01873	null
2025-04-02	Interpreting Emergent Planning in Model-Free Reinforcement Learning	Thomas Bush et.al.	2504.01871	null
2025-04-02	BOGausS: Better Optimized Gaussian Splatting	Stéphane Pateux et.al.	2504.01844	null
2025-04-02	YourBench: Easy Custom Evaluation Sets for Everyone	Sumuk Shashidhar et.al.	2504.01833	link
2025-04-02	Implicit Bias Injection Attacks against Text-to-Image Diffusion Models	Huayang Huang et.al.	2504.01819	link
2025-04-02	DISINFOX: an open-source threat exchange platform serving intelligence on disinformation and influence operations	Felipe Sánchez González et.al.	2504.01803	null
2025-04-02	The protein escape process at the ribosomal exit tunnel has conserved mechanisms across the domains of life	Phuong Thuy Bui et.al.	2504.01731	null
2025-04-02	An Adaptive Proximal Inexact Gradient Framework and Its Application to Per-Antenna Constrained Joint Beamforming and Compression Design	Xilai Fan et.al.	2504.01721	null
2025-03-31	Consistent Subject Generation via Contrastive Instantiated Concepts	Lee Hsin-Ying et.al.	2503.24387	null
2025-03-31	Any2Caption:Interpreting Any Condition to Caption for Controllable Video Generation	Shengqiong Wu et.al.	2503.24379	null
2025-03-31	InstructRestore: Region-Customized Image Restoration with Human Instructions	Shuaizheng Liu et.al.	2503.24357	link
2025-03-31	Enhancing Image Resolution of Solar Magnetograms: A Latent Diffusion Model Approach	Francesco Pio Ramunno et.al.	2503.24271	link
2025-04-01	Visual Acoustic Fields	Yuelei Li et.al.	2503.24270	null
2025-03-31	Pre-training with 3D Synthetic Data: Learning 3D Point Cloud Instance Segmentation from 3D Synthetic Scenes	Daichi Otsuka et.al.	2503.24229	null
2025-03-31	AI-Assisted Colonoscopy: Polyp Detection and Segmentation using Foundation Models	Uxue Delaquintana-Aramendi et.al.	2503.24138	link
2025-03-31	Grounding Agent Reasoning in Image Schemas: A Neurosymbolic Approach to Embodied Cognition	François Olivier et.al.	2503.24110	null
2025-03-31	Controlled Latent Diffusion Models for 3D Porous Media Reconstruction	Danilo Naiff et.al.	2503.24083	link
2025-03-31	COSMO: Combination of Selective Memorization for Low-cost Vision-and-Language Navigation	Siqi Zhang et.al.	2503.24065	null
2025-03-31	ReaLM: Reliable and Efficient Large Language Model Inference with Statistical Algorithm-Based Fault Tolerance	Tong Xie et.al.	2503.24053	link
2025-03-31	Automated Discovery of Tactic Libraries for Interactive Theorem Proving	Yutong Xin et.al.	2503.24036	null
2025-03-31	DenseFormer: Learning Dense Depth Map from Sparse Depth and Image via Conditional Diffusion Model	Ming Yuan et.al.	2503.23993	null
2025-03-31	Two-wheel-driven Electric Superbike Powertrain Optimization	Adelmo Niccolai et.al.	2503.23984	null
2025-04-02	Machine Learning-assisted High-speed Combinatorial Optimization with Ising Machines for Dynamically Changing Problems	Yohei Hamakawa et.al.	2503.23966	null
2025-03-28	DSO: Aligning 3D Generators with Simulation Feedback for Physical Soundness	Ruining Li et.al.	2503.22677	null
2025-03-28	Zero4D: Training-Free 4D Video Generation From Single Video Using Off-the-Shelf Video Diffusion Model	Jangho Park et.al.	2503.22622	null
2025-03-28	Generative Latent Neural PDE Solver using Flow Matching	Zijie Li et.al.	2503.22600	null
2025-03-28	RELD: Regularization by Latent Diffusion Models for Image Restoration	Pasquale Cascarano et.al.	2503.22563	null
2025-03-28	Deterministic Medical Image Translation via High-fidelity Brownian Bridges	Qisheng He et.al.	2503.22531	null
2025-03-28	Automated UX Insights from User Research Videos by Integrating Facial Emotion and Text Sentiment	Simran Kaur Ghatoray et.al.	2503.22510	null
2025-03-28	Scenario Dreamer: Vectorized Latent Diffusion for Generating Driving Simulation Environments	Luke Rowe et.al.	2503.22496	null
2025-03-28	GAITGen: Disentangled Motion-Pathology Impaired Gait Generative Model -- Bringing Motion Generation to the Clinical Domain	Vida Adeli et.al.	2503.22397	null
2025-03-28	Volumetric Material Decomposition Using Spectral Diffusion Posterior Sampling with a Compressed Polychromatic Forward Model	Xiao Jiang et.al.	2503.22392	null
2025-03-28	Meta-LoRA: Meta-Learning LoRA Components for Domain-Aware ID Personalization	Barış Batuhan Topal et.al.	2503.22352	null
2025-03-28	GCRayDiffusion: Pose-Free Surface Reconstruction via Geometric Consistent Ray Diffusion	Li-Heng Chen et.al.	2503.22349	null
2025-03-28	Semantix: An Energy Guided Sampler for Semantic Style Transfer	Huiang He et.al.	2503.22344	null
2025-03-28	SKDU at De-Factify 4.0: Natural Language Features for AI-Generated Text-Detection	Shrikant Malviya et.al.	2503.22338	link
2025-03-28	Imperceptible but Forgeable: Practical Invisible Watermark Forgery via Diffusion Models	Ziping Dong et.al.	2503.22330	null
2025-03-28	BanglAssist: A Bengali-English Generative AI Chatbot for Code-Switching and Dialect-Handling in Customer Service	Francesco Kruk et.al.	2503.22283	null
2025-03-27	VideoMage: Multi-Subject and Motion Customization of Text-to-Video Diffusion Models	Chi-Pin Huang et.al.	2503.21781	null
2025-03-27	StyleMotif: Multi-Modal Motion Stylization using Style-Content Cross Fusion	Ziyu Guo et.al.	2503.21775	null
2025-03-27	Optimal Stepsize for Diffusion Sampling	Jianning Pei et.al.	2503.21774	link
2025-03-27	A Unified Image-Dense Annotation Generation Model for Underwater Scenes	Hongkai Lin et.al.	2503.21771	link
2025-03-27	Exploring the Evolution of Physics Cognition in Video Generation: A Survey	Minghui Lin et.al.	2503.21765	link
2025-03-27	A Unified Framework for Diffusion Bridge Problems: Flow Matching and Schrödinger Matching into One	Minyoung Kim et.al.	2503.21756	null
2025-03-27	VBench-2.0: Advancing Video Generation Benchmark Suite for Intrinsic Faithfulness	Dian Zheng et.al.	2503.21755	link
2025-03-27	3DGen-Bench: Comprehensive Benchmark Suite for 3D Generative Models	Yuhan Zhang et.al.	2503.21745	null
2025-03-27	Progressive Rendering Distillation: Adapting Stable Diffusion for Instant Text-to-Mesh Generation without 3D Data	Zhiyuan Ma et.al.	2503.21694	link
2025-03-27	A Comprehensive Benchmark for RNA 3D Structure-Function Modeling	Luis Wyss et.al.	2503.21681	link
2025-03-27	A friendly introduction to triangular transport	Maximilian Ramgraber et.al.	2503.21673	null
2025-03-27	Audio-driven Gesture Generation via Deviation Feature in the Latent Space	Jiahui Chen et.al.	2503.21616	null
2025-03-27	Critical Iterative Denoising: A Discrete Generative Model Applied to Graphs	Yoann Boget et.al.	2503.21592	null
2025-03-27	AlignDiff: Learning Physically-Grounded Camera Alignment via Diffusion	Liuyue Xie et.al.	2503.21581	null
2025-03-27	SyncSDE: A Probabilistic Framework for Diffusion Synchronization	Hyunjun Lee et.al.	2503.21555	null
2025-03-26	Free4D: Tuning-free 4D Scene Generation with Spatial-Temporal Consistency	Tianqi Liu et.al.	2503.20785	link
2025-03-26	FB-4D: Spatial-Temporal Coherent Dynamic 3D Content Generation with Feature Banks	Jinwei Li et.al.	2503.20784	link
2025-03-26	PUREPath-B: A Tessellated Bayesian Model for Recovering CMB B-modes over Large Angular Scales of the Sky	Vipin Sudevan et.al.	2503.20774	null
2025-03-26	Reliable algorithm selection for machine learning-guided design	Clara Fannjiang et.al.	2503.20767	null
2025-03-26	RecTable: Fast Modeling Tabular Data with Rectified Flow	Masane Fuchi et.al.	2503.20731	link
2025-03-26	Continual learning via probabilistic exchangeable sequence modelling	Hanwen Xing et.al.	2503.20725	null
2025-03-26	Dynamic Motion Blending for Versatile Motion Editing	Nan Jiang et.al.	2503.20724	null
2025-03-26	From Annotation to Adaptation: Metrics, Synthetic Data, and Aspect Extraction for Aspect-Based Sentiment Analysis with Large Language Models	Nikita Neveditsin et.al.	2503.20715	null
2025-03-26	Flow of a two-dimensional liquid foam: Impact of surfactant type and boundary conditions	Farshad Nazari et.al.	2503.20710	null
2025-03-26	BizGen: Advancing Article-level Visual Text Rendering for Infographics Generation	Yuyang Peng et.al.	2503.20672	null
2025-03-26	ARMO: Autoregressive Rigging for Multi-Category Objects	Mingze Sun et.al.	2503.20663	null
2025-03-26	MMGen: Unified Multi-modal Image Generation and Understanding in One Go	Jiepeng Wang et.al.	2503.20644	null
2025-03-26	Diffusion Counterfactuals for Image Regressors	Trung Duc Ha et.al.	2503.20595	link
2025-03-26	Supply chain network rewiring dynamics at the firm-level	Tobias Reisch et.al.	2503.20594	link
2025-03-26	Stochastic Transport Maps in Diffusion Models and Sampling	Xicheng Zhang et.al.	2503.20573	null
2025-03-25	Learning 3D Object Spatial Relationships from Pre-trained 2D Diffusion Models	Sangwon Beak et.al.	2503.19914	null
2025-03-25	PartRM: Modeling Part-Level Dynamics with Large Cross-State Reconstruction Model	Mingju Gao et.al.	2503.19913	null
2025-03-26	AvatarArtist: Open-Domain 4D Avatarization	Hongyu Liu et.al.	2503.19906	null
2025-03-25	ICE: Intrinsic Concept Extraction from a Single Image via Diffusion Models	Fernando Julio Cendra et.al.	2503.19902	null
2025-03-25	Scaling Down Text Encoders of Text-to-Image Diffusion Models	Lifu Wang et.al.	2503.19897	link
2025-03-25	Visuo-Tactile Object Pose Estimation for a Multi-Finger Robot Hand with Low-Resolution In-Hand Tactile Sensing	Lukas Mack et.al.	2503.19893	null
2025-03-25	FireEdit: Fine-grained Instruction-based Image Editing via Region-aware Vision Language Model	Jun Zhou et.al.	2503.19839	null
2025-03-25	TopoGEN: topology-driven microstructure generation for in silico modeling of fiber network mechanics	Sara Cardona et.al.	2503.19832	null
2025-03-25	IgCraft: A versatile sequence generation framework for antibody discovery and engineering	Matthew Greenig et.al.	2503.19821	link
2025-03-25	Unpaired Object-Level SAR-to-Optical Image Translation for Aircraft with Keypoints-Guided Diffusion Models	Ruixi You et.al.	2503.19798	null
2025-03-26	In the Blink of an Eye: Instant Game Map Editing using a Generative-AI Smart Brush	Vitaly Gnatyuk et.al.	2503.19793	null
2025-03-25	SITA: Structurally Imperceptible and Transferable Adversarial Attacks for Stylized Image Generation	Jingdan Kang et.al.	2503.19791	link
2025-03-25	Fine-Grained Erasure in Text-to-Image Diffusion-based Foundation Models	Kartik Thakral et.al.	2503.19783	null
2025-03-25	PCM : Picard Consistency Model for Fast Parallel Sampling of Diffusion Models	Junhyuk So et.al.	2503.19731	null
2025-03-25	CoSimGen: Controllable Diffusion Model for Simultaneous Image and Mask Generation	Rupak Bose et.al.	2503.19661	null
2025-03-24	Target-Aware Video Diffusion Models	Taeksoo Kim et.al.	2503.18950	null
2025-03-24	Equivariant Image Modeling	Ruixiao Dong et.al.	2503.18948	link
2025-03-25	Aether: Geometric-Aware Unified World Modeling	Aether Team et.al.	2503.18945	null
2025-03-24	Video-T1: Test-Time Scaling for Video Generation	Fangfu Liu et.al.	2503.18942	null
2025-03-24	Training-free Diffusion Acceleration with Bottleneck Sampling	Ye Tian et.al.	2503.18940	null
2025-03-24	SyncVP: Joint Diffusion for Synchronous Multi-Modal Video Prediction	Enrico Pallotta et.al.	2503.18933	link
2025-03-24	Entanglement swapping systems toward a quantum internet	Samantha I. Davis et.al.	2503.18906	null
2025-03-24	3DSwapping: Texture Swapping For 3D Object From Single Reference Image	Xiao Cao et.al.	2503.18853	null
2025-03-24	Dual-domain Multi-path Self-supervised Diffusion Model for Accelerated MRI Reconstruction	Yuxuan Zhang et.al.	2503.18836	null
2025-03-24	Blind structured illumination microscopy via generalized Richardson-Lucy method	Valentina Capalbo et.al.	2503.18786	null
2025-03-24	Duality Symmetry in Causality Constraints for Enhanced Acoustic Absorption	Sichao Qu et.al.	2503.18740	null
2025-03-24	RoboEngine: Plug-and-Play Robot Data Augmentation with Semantic Robot Segmentation and Background Generation	Chengbo Yuan et.al.	2503.18738	null
2025-03-24	Thermalizer: Stable autoregressive neural emulation of spatiotemporal chaos	Chris Pedersen et.al.	2503.18731	null
2025-03-24	NullSwap: Proactive Identity Cloaking Against Deepfake Face Swapping	Tianyi Wang et.al.	2503.18678	null
2025-03-24	Human Motion Unlearning	Edoardo De Matteis et.al.	2503.18674	null
2025-03-21	Position: Interactive Generative Video as Next-Generation Game Engine	Jiwen Yu et.al.	2503.17359	null
2025-03-21	Predicting Potential Customer Support Needs and Optimizing Search Ranking in a Two-Sided Marketplace	Do-kyum Kim et.al.	2503.17329	null
2025-03-21	Preference-Guided Diffusion for Multi-Objective Offline Optimization	Yashas Annadani et.al.	2503.17299	null
2025-03-21	Cross-Band Modulation Design for Hybrid RF-Optical Systems	Thrassos K. Oikonomou et.al.	2503.17296	null
2025-03-21	Offline Model-Based Optimization: Comprehensive Review	Minsu Kim et.al.	2503.17286	link
2025-03-21	Unsupervised Joint Learning of Optical Flow and Intensity with Event Cameras	Shuang Guo et.al.	2503.17262	link
2025-03-21	Deep End-to-End Posterior ENergy (DEEPEN) for image recovery	Jyothi Rikhab Chand et.al.	2503.17244	null
2025-03-21	Leveraging Text-to-Image Generation for Handling Spurious Correlation	Aryan Yazdan Parast et.al.	2503.17226	null
2025-03-21	Neuro-Symbolic Scene Graph Conditioning for Synthetic Image Dataset Generation	Giacomo Savazzi et.al.	2503.17224	null
2025-03-21	UniCon: Unidirectional Information Flow for Effective Control of Large-Scale Diffusion Models	Fanghua Yu et.al.	2503.17221	null
2025-03-21	FreeUV: Ground-Truth-Free Realistic Facial UV Texture Recovery via Cross-Assembly Inference Strategy	Xingchao Yang et.al.	2503.17197	null
2025-03-21	TreeSynth: Synthesizing Diverse Data from Scratch via Tree-Guided Subspace Partitioning	Sheng Wang et.al.	2503.17195	null
2025-03-21	ExplainitAI: When do we trust artificial intelligence? The influence of content and explainability in a cross-cultural comparison	Sora Kang et.al.	2503.17158	null
2025-03-21	D2C: Unlocking the Potential of Continuous Autoregressive Image Generation with Discrete Tokens	Panpan Wang et.al.	2503.17155	null
2025-03-21	R2LDM: An Efficient 4D Radar Super-Resolution Framework Leveraging Diffusion Model	Boyuan Zheng et.al.	2503.17097	null
2025-03-20	Bridging Continuous and Discrete Tokens for Autoregressive Visual Generation	Yuqing Wang et.al.	2503.16430	null
2025-03-20	SynCity: Training-Free Generation of 3D Worlds	Paul Engstler et.al.	2503.16420	null
2025-03-20	DreamTexture: Shape from Virtual Texture with Analysis by Augmentation	Ananta R. Bhattarai et.al.	2503.16412	null
2025-03-20	VerbDiff: Text-Only Diffusion Models with Enhanced Interaction Awareness	SeungJu Cha et.al.	2503.16406	link
2025-03-20	ScalingNoise: Scaling Inference-Time Search for Generating Infinite Videos	Haolin Yang et.al.	2503.16400	null
2025-03-20	Scale-wise Distillation of Diffusion Models	Nikita Starodubcev et.al.	2503.16397	null
2025-03-21	SV4D 2.0: Enhancing Spatio-Temporal Consistency in Multi-View Video Diffusion for High-Quality 4D Generation	Chun-Han Yao et.al.	2503.16396	null
2025-03-20	Do Visual Imaginations Improve Vision-and-Language Navigation Agents?	Akhil Perincherry et.al.	2503.16394	null
2025-03-20	LaPIG: Cross-Modal Generation of Paired Thermal and Visible Facial Images	Leyang Wang et.al.	2503.16376	null
2025-03-20	Heat transfer and mixing in initiated Chemical Vapor Deposition analyzed by in-situ gas composition sensing	Simon Shindler et.al.	2503.16373	null
2025-03-20	Ultra-Resolution Adaptation with Ease	Ruonan Yu et.al.	2503.16322	link
2025-03-20	Rapid patient-specific neural networks for intraoperative X-ray to volume registration	Vivek Gopalakrishnan et.al.	2503.16309	link
2025-03-20	Unleashing Vecset Diffusion Model for Fast Shape Generation	Zeqiang Lai et.al.	2503.16302	link
2025-03-20	Diffusion-augmented Graph Contrastive Learning for Collaborative Filter	Fan Huang et.al.	2503.16290	null
2025-03-20	SceneMI: Motion In-betweening for Modeling Human-Scene Interactions	Inwoo Hwang et.al.	2503.16289	null
2025-03-19	FP4DiT: Towards Effective Floating Point Quantization for Diffusion Transformers	Ruichen Chen et.al.	2503.15465	link
2025-03-19	Di $\mathtt{[M]}$ O: Distilling Masked Diffusion Models into One-step Generator	Yuanzhi Zhu et.al.	2503.15457	null
2025-03-19	MotionStreamer: Streaming Motion Generation via Diffusion-based Autoregressive Model in Causal Latent Space	Lixing Xiao et.al.	2503.15451	null
2025-03-19	LIFT: Latent Implicit Functions for Task- and Data-Agnostic Encoding	Amirhossein Kazerouni et.al.	2503.15420	null
2025-03-19	Temporal Regularization Makes Your Video Generator Stronger	Harold Haodong Chen et.al.	2503.15417	null
2025-03-19	Visual Persona: Foundation Model for Full-Body Human Customization	Jisu Nam et.al.	2503.15406	null
2025-03-19	HQNN-FSP: A Hybrid Classical-Quantum Neural Network for Regression-Based Financial Stock Market Prediction	Prashant Kumar Choudhary et.al.	2503.15403	null
2025-03-19	Online Matching under KIID: Enhanced Competitive Analysis through Ordinary Differential Equation Systems	Pan Xu et.al.	2503.15399	null
2025-03-19	CCDP: Composition of Conditional Diffusion Policies with Guided Sampling	Amirreza Razmjoo et.al.	2503.15386	null
2025-03-19	Material Decomposition in Photon-Counting Computed Tomography with Diffusion Models: Comparative Study and Hybridization with Variational Regularizers	Corentin Vazia et.al.	2503.15383	null
2025-03-19	Real-world validation of a multimodal LLM-powered pipeline for High-Accuracy Clinical Trial Patient Matching leveraging EHR data	Anatole Callies et.al.	2503.15374	link
2025-03-19	SPILL: Domain-Adaptive Intent Clustering based on Selection and Pooling with Large Language Models	I-Fan Lin et.al.	2503.15351	null
2025-03-19	Euclid Quick Data Release (Q1). Active galactic nuclei identification using diffusion-based inpainting of Euclid VIS images	Euclid Collaboration et.al.	2503.15321	null
2025-03-19	SENAI: Towards Software Engineering Native Generative Artificial Intelligence	Mootez Saad et.al.	2503.15282	null
2025-03-19	ImputeGAP: A Comprehensive Library for Time Series Imputation	Quentin Nater et.al.	2503.15250	null
2025-03-18	MusicInfuser: Making Video Diffusion Listen and Dance	Susung Hong et.al.	2503.14505	null
2025-03-18	The Power of Context: How Multimodality Improves Image Super-Resolution	Kangfu Mei et.al.	2503.14503	null
2025-03-18	Deeply Supervised Flow-Based Generative Models	Inkyu Shin et.al.	2503.14494	null
2025-03-18	Cosmos-Transfer1: Conditional World Generation with Adaptive Multimodal Control	NVIDIA et.al.	2503.14492	link
2025-03-18	Stable Virtual Camera: Generative View Synthesis with Diffusion Models	Jensen et.al.	2503.14489	null
2025-03-18	DiffMoE: Dynamic Token Selection for Scalable Diffusion Transformers	Minglei Shi et.al.	2503.14487	null
2025-03-18	Lux Post Facto: Learning Portrait Performance Relighting with Conditional Video Diffusion and a Hybrid Dataset	Yiqun Mei et.al.	2503.14485	null
2025-03-18	ICE-Bench: A Unified and Comprehensive Benchmark for Image Creating and Editing	Yulin Pan et.al.	2503.14482	null
2025-03-18	SIR-DIFF: Sparse Image Sets Restoration with Multi-View Diffusion Model	Yucheng Mao et.al.	2503.14463	null
2025-03-18	The Atacama Cosmology Telescope: DR6 Constraints on Extended Cosmological Models	Erminia Calabrese et.al.	2503.14454	null
2025-03-18	Bolt3D: Generating 3D Scenes in Seconds	Stanislaw Szymanowicz et.al.	2503.14445	null
2025-03-18	MagicComp: Training-free Dual-Phase Refinement for Compositional Video Generation	Hongyu Zhang et.al.	2503.14428	null
2025-03-18	Diffusion-based Facial Aesthetics Enhancement with 3D Structure Guidance	Lisha Li et.al.	2503.14402	null
2025-03-18	A Comprehensive Scatter Correction Model for Micro-Focus Dual-Source Imaging Systems: Combining Ambient, Cross, and Forward Scatter	Jianing Sun et.al.	2503.14386	null
2025-03-18	Impossible Videos	Zechen Bai et.al.	2503.14378	null
2025-03-17	Amodal3R: Amodal 3D Reconstruction from Occluded 2D Images	Tianhao Wu et.al.	2503.13439	null
2025-03-17	Infinite Mobility: Scalable High-Fidelity Synthesis of Articulated Objects via Procedural Generation	Xinyu Lian et.al.	2503.13424	null
2025-03-17	Securing Virtual Reality Experiences: Unveiling and Tackling Cybersickness Attacks with Explainable AI	Ripan Kumar Kundu et.al.	2503.13419	null
2025-03-17	Cream of the Crop: Harvesting Rich, Scalable and Transferable Multi-Modal Data for Instruction Fine-Tuning	Mengyao Lyu et.al.	2503.13383	null
2025-03-17	One-Step Residual Shifting Diffusion for Image Super-Resolution via Distillation	Daniil Selikhanovych et.al.	2503.13358	null
2025-03-17	A 1.8 m class pathfinder Raman LIDAR for the Northern Site of the Cherenkov Telescope Array Observatory -- Technical Design	Otger Ballester et.al.	2503.13349	null
2025-03-17	Artificial Intelligence-Driven Prognostic Classification of COVID-19 Using Chest X-rays: A Deep Learning Approach	Alfred Simbun et.al.	2503.13277	null
2025-03-17	Generative Gaussian Splatting: Generating 3D Scenes with Video Diffusion Priors	Katja Schwarz et.al.	2503.13272	null
2025-03-17	Graph Generative Models Evaluation with Masked Autoencoder	Chengen Wang et.al.	2503.13271	null
2025-03-17	FlexWorld: Progressively Expanding 3D Scenes for Flexiable-View Synthesis	Luxi Chen et.al.	2503.13265	null
2025-03-17	Dense Policy: Bidirectional Autoregressive Learning of Actions	Yue Su et.al.	2503.13217	null
2025-03-17	MedLoRD: A Medical Low-Resource Diffusion Model for High-Resolution 3D CT Image Synthesis	Marvin Seyfarth et.al.	2503.13211	null
2025-03-17	Patient-specific radiomic feature selection with reconstructed healthy persona of knee MR images	Yaxi Chen et.al.	2503.13131	null
2025-03-17	3D Human Interaction Generation: A Survey	Siyuan Fan et.al.	2503.13120	null
2025-03-17	DTGBrepGen: A Novel B-rep Generative Model through Decoupling Topology and Geometry	Jing Li et.al.	2503.13110	link
2025-03-14	From few to many maps: A fast map-level emulator for extreme augmentation of CMB systematics datasets	P. Campeti et.al.	2503.11643	link
2025-03-14	Gradient-bridged Posterior: Bayesian Inference for Models with Implicit Functions	Cheng Zeng et.al.	2503.11637	null
2025-03-14	Pathology Image Compression with Pre-trained Autoencoders	Srikar Yellapragada et.al.	2503.11591	null
2025-03-14	Dynamics of a coupled nonlocal PDE-ODE system with spatial memory: well-posedness, stability, and bifurcation analysis	Yurij Salmaniw et.al.	2503.11550	null
2025-03-14	AugGen: Synthetic Augmentation Can Improve Discriminative Models	Parsa Rahimi et.al.	2503.11544	null
2025-03-14	Exploring Typographic Visual Prompts Injection Threats in Cross-Modality Generation Models	Hao Cheng et.al.	2503.11519	null
2025-03-14	Perfect Stabilization of Biomolecular Adhesions under Load	Anton F. Burnet et.al.	2503.11510	null
2025-03-14	Exponential Quantum Advantage for Simulating Open Classical Systems	Agi Villanyi et.al.	2503.11483	null
2025-03-14	T2I-FineEval: Fine-Grained Compositional Metric for Text-to-Image Evaluation	Seyed Mohammad Hadi Hosseini et.al.	2503.11481	null
2025-03-14	Integrating LLMs in Gamified Systems	Carlos J. Costa et.al.	2503.11458	null
2025-03-14	Extending Ambient Pressure X-ray Photoelectron Spectroscopy to Plasma Studies: A novel and flexible plasma gun approach	Yang Gu et.al.	2503.11446	null
2025-03-14	TASTE-Rob: Advancing Video Generation of Task-Oriented Hand-Object Interaction for Generalizable Robotic Manipulation	Hongxiang Zhao et.al.	2503.11423	null
2025-03-14	MTV-Inpaint: Multi-Task Long Video Inpainting	Shiyuan Yang et.al.	2503.11412	null
2025-03-14	Towards A Correct Usage of Cryptography in Semantic Watermarks for Diffusion Models	Jonas Thietke et.al.	2503.11404	null
2025-03-14	BEVDiffLoc: End-to-End LiDAR Global Localization in BEV View based on Diffusion Model	Ziyue Wang et.al.	2503.11372	link
2025-03-13	GoT: Unleashing Reasoning Capability of Multimodal Large Language Model for Visual Generation and Editing	Rongyao Fang et.al.	2503.10639	link
2025-03-13	Studying Classifier(-Free) Guidance From a Classifier-Centric Perspective	Xiaoming Zhao et.al.	2503.10638	null
2025-03-14	Distilling Diversity and Control in Diffusion Models	Rohit Gandikota et.al.	2503.10637	null
2025-03-13	HybridVLA: Collaborative Diffusion and Autoregression in a Unified Vision-Language-Action Model	Jiaming Liu et.al.	2503.10631	null
2025-03-13	NIL: No-data Imitation Learning by Leveraging Pre-trained Video Diffusion Models	Mert Albaba et.al.	2503.10626	null
2025-03-13	DiT-Air: Revisiting the Efficiency of Diffusion Model Architecture Design in Text to Image Generation	Chen Chen et.al.	2503.10618	null
2025-03-13	MuDG: Taming Multi-modal Diffusion with Gaussian Splatting for Urban Scene Reconstruction	Yingshuang Zou et.al.	2503.10604	null
2025-03-13	CameraCtrl II: Dynamic Scene Exploration via Camera-controlled Video Diffusion Models	Hao He et.al.	2503.10592	null
2025-03-13	Long Context Tuning for Video Generation	Yuwei Guo et.al.	2503.10589	null
2025-03-13	Sample and Map from a Single Convex Potential: Generation using Conjugate Moment Measures	Nina Vesseron et.al.	2503.10576	null
2025-03-13	MASQUE: A Text-Guided Diffusion-Based Framework for Localized and Customized Adversarial Makeup	Youngjin Kwon et.al.	2503.10549	null
2025-03-13	Conformal Prediction Sets for Deep Generative Models via Reduction to Conformal Regression	Hooman Shahrokhi et.al.	2503.10512	null
2025-03-13	Streaming Generation of Co-Speech Gestures via Accelerated Rolling Diffusion	Evgeniia Vu et.al.	2503.10488	null
2025-03-13	Applying Tabular Deep Learning Models to Estimate Crash Injury Types of Young Motorcyclists	Shriyank Somvanshi et.al.	2503.10474	null
2025-03-13	Finetuning Generative Trajectory Model with Reinforcement Learning from Human Feedback	Derun Li et.al.	2503.10434	null
2025-03-12	PISA Experiments: Exploring Physics Post-Training for Video Diffusion Models by Watching Stuff Drop	Chenyu Li et.al.	2503.09595	link
2025-03-12	Minimax Optimality of the Probability Flow ODE for Diffusion Models	Changxiao Cai et.al.	2503.09583	null
2025-03-12	Block Diffusion: Interpolating Between Autoregressive and Diffusion Language Models	Marianne Arriola et.al.	2503.09573	link
2025-03-12	TPDiff: Temporal Pyramid Video Diffusion Model	Lingmin Ran et.al.	2503.09566	null
2025-03-12	FCaS: Fine-grained Cardiac Image Synthesis based on 3D Template Conditional Diffusion Model	Jiahao Xia et.al.	2503.09560	null
2025-03-12	GenHPE: Generative Counterfactuals for 3D Human Pose Estimation with Radio Frequency Signals	Shuokang Huang et.al.	2503.09537	null
2025-03-12	Total Ionizing Dose Measurements in Small Satellites in LEO using LabOSat-01	Lucas Finazzi et.al.	2503.09520	null
2025-03-12	CM-Diff: A Single Generative Network for Bidirectional Cross-Modality Translation Diffusion Model Between Infrared and Visible Images	Bin Hu et.al.	2503.09514	null
2025-03-12	DAMM-Diffusion: Learning Divergence-Aware Multi-Modal Diffusion Model for Nanoparticles Distribution Prediction	Junjie Zhou et.al.	2503.09491	link
2025-03-12	Hybrid Rendering for Multimodal Autonomous Driving: Merging Neural and Physics-Based Simulation	Máté Tóth et.al.	2503.09464	null
2025-03-12	How Well Does Your Tabular Generator Learn the Structure of Tabular Data?	Xiangjian Jiang et.al.	2503.09453	link
2025-03-12	Sparse Autoencoder as a Zero-Shot Classifier for Concept Erasing in Text-to-Image Diffusion Models	Zhihua Tian et.al.	2503.09446	link
2025-03-12	SuperCarver: Texture-Consistent 3D Geometry Super-Resolution for High-Fidelity Surface Detail Generation	Qijian Zhang et.al.	2503.09439	null
2025-03-12	Alias-Free Latent Diffusion Models:Improving Fractional Shift Equivariance of Diffusion Latent Space	Yifan Zhou et.al.	2503.09419	link
2025-03-12	Diff-CL: A Novel Cross Pseudo-Supervision Method for Semi-supervised Medical Image Segmentation	Xiuzhen Guo et.al.	2503.09408	null
2025-03-11	OmniMamba: Efficient and Unified Multimodal Understanding and Generation via State Space Models	Jialv Zou et.al.	2503.08686	link
2025-03-11	GarmentCrafter: Progressive Novel View Synthesis for Single-View 3D Garment Reconstruction and Editing	Yuanhao Wang et.al.	2503.08678	null
2025-03-12	OmniPaint: Mastering Object-Oriented Editing via Disentangled Insertion-Removal Inpainting	Yongsheng Yu et.al.	2503.08677	null
2025-03-11	Language-Depth Navigated Thermal and Visible Image Fusion	Jinchang Zhang et.al.	2503.08676	null
2025-03-11	Keypoint Detection and Description for Raw Bayer Images	Jiakai Lin et.al.	2503.08673	null
2025-03-11	Modeling Stock Return Distributions and Pricing Options	Xinxin Jiang et.al.	2503.08666	null
2025-03-11	REGEN: Learning Compact Video Embedding with (Re-)Generative Decoder	Yitian Zhang et.al.	2503.08665	null
2025-03-11	MEAT: Multiview Diffusion Model for Human Generation on Megapixels with Mesh Attention	Yuhan Wang et.al.	2503.08664	link
2025-03-11	MF-VITON: High-Fidelity Mask-Free Virtual Try-On with Minimal Input	Zhenchen Wan et.al.	2503.08650	null
2025-03-11	Rethinking Diffusion Model in High Dimension	Zhenxin Zheng et.al.	2503.08643	link
2025-03-11	Efficient Many-Shot In-Context Learning with Dynamic Block-Sparse Attention	Emily Xiao et.al.	2503.08640	link
2025-03-11	LightGen: Efficient Image Generation through Knowledge Distillation and Direct Preference Optimization	Xianfeng Wu et.al.	2503.08619	link
2025-03-11	Tuning-Free Multi-Event Long Video Generation via Synchronized Coupled Sampling	Subin Kim et.al.	2503.08605	null
2025-03-11	3D Point Cloud Generation via Autoregressive Up-sampling	Ziqiao Meng et.al.	2503.08594	null
2025-03-11	Proc4Gem: Foundation models for physical agency through procedural generation	Yixin Lin et.al.	2503.08593	null
2025-03-10	GenAIReading: Augmenting Human Cognition with Interactive Digital Textbooks Using Large Language Models and Image Generation Models	Ryugo Morita et.al.	2503.07463	null
2025-03-10	Advancing our Understanding of Optoionic Effects for the Design of Solar Batteries: A Theoretical Perspective	Matteo Rinaldi et.al.	2503.07460	null
2025-03-10	Is a Good Foundation Necessary for Efficient Reinforcement Learning? The Computational Role of the Base Model in Exploration	Dylan J. Foster et.al.	2503.07453	null
2025-03-10	DRESS: Diffusion Reasoning-based Reward Shaping Scheme For Intelligent Networks	Feiran You et.al.	2503.07433	link
2025-03-10	AR-Diffusion: Asynchronous Video Generation with Auto-Regressive Diffusion	Mingzhen Sun et.al.	2503.07418	null
2025-03-10	TimeStep Master: Asymmetrical Mixture of Timestep LoRA Experts for Versatile and Efficient Diffusion Models in Vision	Shaobin Zhuang et.al.	2503.07416	null
2025-03-10	SPEED: Scalable, Precise, and Efficient Concept Erasure for Diffusion Models	Ouxiang Li et.al.	2503.07392	link
2025-03-10	PersonaBooth: Personalized Text-to-Motion Generation	Boeun Kim et.al.	2503.07390	null
2025-03-10	TRCE: Towards Reliable Malicious Concept Erasure in Text-to-Image Diffusion Models	Ruidong Chen et.al.	2503.07389	link
2025-03-10	RepoST: Scalable Repository-Level Coding Environment Construction with Sandbox Testing	Yiqing Xie et.al.	2503.07358	link
2025-03-10	AttenST: A Training-Free Attention-Driven Style Transfer Framework with Pre-Trained Diffusion Models	Bo Huang et.al.	2503.07307	link
2025-03-10	Cool-3D: An End-to-End Thermal-Aware Framework for Early-Phase Design Space Exploration of Microfluidic-Cooled 3DICs	Runxi Wang et.al.	2503.07297	link
2025-03-10	Efficient Distillation of Classifier-Free Guidance using Adapters	Cristian Perez Jensen et.al.	2503.07274	null
2025-03-10	Customized SAM 2 for Referring Remote Sensing Image Segmentation	Fu Rong et.al.	2503.07266	null
2025-03-11	AnomalyPainter: Vision-Language-Diffusion Synergy for Zero-Shot Realistic and Diverse Industrial Anomaly Synthesis	Zhangyu Lai et.al.	2503.07253	null
2025-03-07	AIM-Fair: Advancing Algorithmic Fairness via Selectively Fine-Tuning Biased Models with Contextual Synthetic Data	Zengqun Zhao et.al.	2503.05665	link
2025-03-07	TrajectoryCrafter: Redirecting Camera Trajectory for Monocular Videos via Diffusion Models	Mark YU et.al.	2503.05638	null
2025-03-07	A functional approach for curve alignment and shape analysis	Issam-Ali Moindjié et.al.	2503.05632	null
2025-03-07	Geometric Optimization of Patterned Conductive Polymer Composite-based Strain Sensors Toward Enhanced Sensing Performance	Jia-Chen Shang et.al.	2503.05603	null
2025-03-07	Diffusion Models for Cayley Graphs	Michael R. Douglas et.al.	2503.05558	null
2025-03-07	Radio Frequency from Optical with Instabilities below $10^{-15}$ - Generation and Measurement	A. Hati et.al.	2503.05547	null
2025-03-10	Accelerating db-A for Kinodynamic Motion Planning Using Diffusion*	Julius Franke et.al.	2503.05539	null
2025-03-07	Post-Hoc Concept Disentanglement: From Correlated to Isolated Concept Representations	Eren Erogullari et.al.	2503.05522	link
2025-03-07	Noise-Robust Radio Frequency Fingerprint Identification Using Denoise Diffusion Model	Guolin Yin et.al.	2503.05514	null
2025-03-07	Localized necking under global compression in two-scale metallic hierarchical solids	Naresh Chockalingam S. et.al.	2503.05498	null
2025-03-07	Umbilical Choir: Automated Live Testing for Edge-To-Cloud FaaS Applications	Mohammadreza Malekabbasi et.al.	2503.05495	link
2025-03-07	Statistical Deficiency for Task Inclusion Estimation	Loïc Fosse et.al.	2503.05491	null
2025-03-07	De Novo Design of Protein-Binding Peptides by Quantum Computing	Lars Meuser et.al.	2503.05458	null
2025-03-07	VLMs Play StarCraft II: A Benchmark and Multimodal Decision Method	Weiyu Ma et.al.	2503.05383	link
2025-03-07	PhysicsGen: Can Generative Models Learn from Images to Predict Complex Physical Relations?	Martin Spitznagel et.al.	2503.05333	null
2025-03-06	Compositional World Knowledge leads to High Utility Synthetic data	Sachit Gaudi et.al.	2503.04687	null
2025-03-06	What Are You Doing? A Closer Look at Controllable Human Video Generation	Emanuele Bugliarello et.al.	2503.04666	null
2025-03-06	Risk-aware Trading Portfolio Optimization	Marco Bianchetti et.al.	2503.04662	null
2025-03-06	IFIR: A Comprehensive Benchmark for Evaluating Instruction-Following in Expert-Domain Information Retrieval	Tingyu Song et.al.	2503.04644	null
2025-03-06	Simulating the Real World: A Unified Survey of Multimodal Generative Models	Yuqi Hu et.al.	2503.04641	link
2025-03-06	3HANDS Dataset: Learning from Humans for Generating Naturalistic Handovers with Supernumerary Robotic Limbs	Artin Saberpour Abadian et.al.	2503.04635	null
2025-03-06	The Best of Both Worlds: Integrating Language Models and Diffusion Models for Video Generation	Aoxiong Yin et.al.	2503.04606	link
2025-03-07	Method for recovering data on unreported low-severity crashes	Alberto Morando et.al.	2503.04529	null
2025-03-06	Learning Object Placement Programs for Indoor Scene Synthesis with Iterative Self Training	Adrian Chang et.al.	2503.04496	null
2025-03-06	InfoSEM: A Deep Generative Model with Informative Priors for Gene Regulatory Network Inference	Tianyu Cui et.al.	2503.04483	null
2025-03-06	ToolFuzz -- Automated Agent Tool Testing	Ivan Milev et.al.	2503.04479	null
2025-03-06	Semantic Alignment of Unimodal Medical Text and Vision Representations	Maxime Di Folco et.al.	2503.04478	null
2025-03-06	PALo: Learning Posture-Aware Locomotion for Quadruped Robots	Xiangyu Miao et.al.	2503.04462	null
2025-03-06	Polling on a circle with non-uniform batch arrivals	Tim Engels et.al.	2503.04448	null
2025-03-06	Can Large Language Models Predict Antimicrobial Resistance Gene?	Hyunwoo Yoo et.al.	2503.04413	null
2025-03-05	Rethinking Video Tokenization: A Conditioned Diffusion-based Approach	Nianzu Yang et.al.	2503.03708	link
2025-03-05	DualDiff+: Dual-Branch Diffusion for High-Fidelity Video Generation with Reward Guidance	Zhao Yang et.al.	2503.03689	link
2025-03-05	Attentive Reasoning Queries: A Systematic Method for Optimizing Instruction-Following in Large Language Models	Bar Karov et.al.	2503.03669	link
2025-03-05	A Generative Approach to High Fidelity 3D Reconstruction from Text Data	Venkat Kumar R et.al.	2503.03664	null
2025-03-05	DoraCycle: Domain-Oriented Adaptation of Unified Generative Model in Multimodal Cycles	Rui Zhao et.al.	2503.03651	link
2025-03-05	Towards Understanding Text Hallucination of Diffusion Models via Local Generation Bias	Rui Lu et.al.	2503.03595	null
2025-03-05	Generative Artificial Intelligence in Robotic Manipulation: A Survey	Kun Zhang et.al.	2503.03464	null
2025-03-05	Predicting Practically? Domain Generalization for Predictive Analytics in Real-world Environments	Hanyu Duan et.al.	2503.03399	link
2025-03-05	Top-K Maximum Intensity Projection Priors for 3D Liver Vessel Segmentation	Xiaotong Zhang et.al.	2503.03367	null
2025-03-05	Video Super-Resolution: All You Need is a Video Diffusion Model	Zhihao Zhan et.al.	2503.03355	null
2025-03-05	Label-Efficient LiDAR Semantic Segmentation with 2D-3D Vision Transformer Adapters	Julia Hindel et.al.	2503.03299	null
2025-03-05	Group Delay Dispersion Measurements of Novel Multilayer Interference Coatings in the Mid-Infrared Spectral Regime	Ulrich Galander et.al.	2503.03289	null
2025-03-06	Optimizing for the Shortest Path in Denoising Diffusion Model	Ping Chen et.al.	2503.03265	link
2025-03-05	Mean Field Game of Controls with State Reflections: Existence and Limit Theory	Lijun Bo et.al.	2503.03253	null
2025-03-05	GenColor: Generative Color-Concept Association in Visual Design	Yihan Hou et.al.	2503.03236	null
2025-03-04	ARINAR: Bi-Level Autoregressive Feature-by-Feature Generative Models	Qinyu Zhao et.al.	2503.02883	link
2025-03-04	SeqFusion: Sequential Fusion of Pre-Trained Models for Zero-Shot Time-Series Forecasting	Ting-Ji Huang et.al.	2503.02836	link
2025-03-04	A Multimodal Symphony: Integrating Taste and Sound through Generative AI	Matteo Spanio et.al.	2503.02823	link
2025-03-04	Feynman-Kac Correctors in Diffusion: Annealing, Guidance, and Product of Experts	Marta Skreta et.al.	2503.02819	link
2025-03-04	"What If Smart Homes Could See Our Homes?": Exploring DIY Smart Home Building Experiences with VLM-Based Camera Sensors	Sojeong Yun et.al.	2503.02816	null
2025-03-04	Generating Reliable Initial Velocity Models for Full-waveform Inversion with Well and Structural Constraints	Qingchen Zhang et.al.	2503.02815	null
2025-03-04	Applying Computational Engineering Modelling to Analyse the Social Impact of Conflict and Violent Events	Felix Schwebel et.al.	2503.02771	null
2025-03-04	Revolutionizing Command Interface: Maximizing Control Efficiency in INO ICAL Experiment with UDP Protocol	Yuvaraj Elangovan et.al.	2503.02751	null
2025-03-04	Seeded Poisson Factorization: Leveraging domain knowledge to fit topic models	Bernd Prostmaier et.al.	2503.02741	link
2025-03-04	Variable-Friction In-Hand Manipulation for Arbitrary Objects via Diffusion-Based Imitation Learning	Qiyang Yan et.al.	2503.02738	null
2025-03-04	Zero-Shot Complex Question-Answering on Long Scientific Documents	Wanting Wang et.al.	2503.02695	link
2025-03-04	Generative Modeling of Microweather Wind Velocities for Urban Air Mobility	Tristan A. Shah et.al.	2503.02690	link
2025-03-04	A user-friendly SPARQL query editor powered by lightweight metadata	Vincent Emonet et.al.	2503.02688	link
2025-03-04	Cellular Automaton With CNN	Valery Ashu et.al.	2503.02652	link
2025-03-04	Xavier: Toward Better Coding Assistance in Authoring Tabular Data Wrangling Scripts	Yunfan Zhou et.al.	2503.02639	null
2025-02-28	How far can we go with ImageNet for Text-to-Image generation?	L. Degeorge et.al.	2502.21318	null
2025-02-28	Raccoon: Multi-stage Diffusion Training with Coarse-to-Fine Curating Videos	Zhiyu Tan et.al.	2502.21314	null
2025-02-28	Does Generation Require Memorization? Creative Diffusion Models using Ambient Diffusion	Kulin Shah et.al.	2502.21278	null
2025-02-28	Dynamic Markov Blanket Detection for Macroscopic Physics Discovery	Jeff Beck et.al.	2502.21217	link
2025-02-28	AMPLE: Event-Driven Accelerator for Mixed-Precision Inference of Graph Neural Networks	Pedro Gimenes et.al.	2502.21196	null
2025-02-28	Joint Modeling in Recommendations: A Survey	Xiangyu Zhao et.al.	2502.21195	null
2025-02-28	SYN-LUNGS: Towards Simulating Lung Nodules with Anatomy-Informed Digital Twins for AI Training	Fakrul Islam Tushar et.al.	2502.21187	null
2025-02-28	A Review on Generative AI For Text-To-Image and Image-To-Image Generation and Implications To Scientific Images	Zineb Sordo et.al.	2502.21151	null
2025-02-28	Rare event modeling with self-regularized normalizing flows: what can we learn from a single failure?	Charles Dawson et.al.	2502.21110	null
2025-02-28	Spatial Reasoning with Denoising Models	Christopher Wewer et.al.	2502.21075	null
2025-02-28	GUIDE: LLM-Driven GUI Generation Decomposition for Automated Prototyping	Kristian Kolthoff et.al.	2502.21068	null
2025-02-28	Synthesizing Individualized Aging Brains in Health and Disease with Generative Models and Parallel Transport	Jingru Fu et.al.	2502.21049	link
2025-02-28	Toward interoperable representation and sharing of disinformation incidents in cyber threat intelligence	Felipe Sánchez González et.al.	2502.20997	link
2025-02-28	Generative Uncertainty in Diffusion Models	Metod Jazbec et.al.	2502.20946	null
2025-02-28	DiffBrush:Just Painting the Art by Your Hands	Jiaming Chu et.al.	2502.20904	null
2025-02-27	InterMimic: Towards Universal Whole-Body Control for Physics-Based Human-Object Interactions	Sirui Xu et.al.	2502.20390	link
2025-02-27	Beyond Next-Token: Next-X Prediction for Autoregressive Visual Generation	Sucheng Ren et.al.	2502.20388	link
2025-02-27	Tight Inversion: Image-Conditioned Inversion for Real Image Editing	Edo Kadosh et.al.	2502.20376	null
2025-02-27	Constrained Generative Modeling with Manually Bridged Diffusion Models	Saeid Naderiparizi et.al.	2502.20371	null
2025-02-27	ACCORD: Application Context-aware Cross-layer Optimization and Resource Design for 5G/NextG Machine-centric Applications	Azuka Chiejina et.al.	2502.20320	null
2025-02-27	FlexVAR: Flexible Visual Autoregressive Modeling without Residual Prediction	Siyu Jiao et.al.	2502.20313	link
2025-02-27	Mobius: Text to Seamless Looping Video Generation via Latent Shift	Xiuli Bi et.al.	2502.20307	link
2025-02-27	Explainable, Multi-modal Wound Infection Classification from Images Augmented with Generated Captions	Palawat Busaranuvong et.al.	2502.20277	null
2025-02-27	Do computer vision foundation models learn the low-level characteristics of the human visual system?	Yancheng Cai et.al.	2502.20256	null
2025-02-28	Beyond Natural Language Perplexity: Detecting Dead Code Poisoning in Code Generation Datasets	Chi-Chien Tsai et.al.	2502.20246	null
2025-02-27	From Retrieval to Generation: Comparing Different Approaches	Abdelrahman Abdallah et.al.	2502.20245	null
2025-02-27	Attention Distillation: A Unified Approach to Visual Characteristics Transfer	Yang Zhou et.al.	2502.20235	link
2025-02-27	AI Will Always Love You: Studying Implicit Biases in Romantic AI Companions	Clare Grogan et.al.	2502.20231	link
2025-02-27	Model Checking Linear Temporal Logic with Standpoint Modalities	Rajab Aghamov et.al.	2502.20193	null
2025-02-27	Multimodal Representation Alignment for Image Generation: Text-Image Interleaved Control Is Easier Than You Think	Liang Chen et.al.	2502.20172	link
2025-02-26	Multi-modal Contrastive Learning for Tumor-specific Missing Modality Synthesis	Minjoo Lim et.al.	2502.19390	null
2025-02-26	Deep Learning For Time Series Analysis With Application On Human Motion	Ali Ismail-Fawaz et.al.	2502.19364	null
2025-02-26	Shh, don't say that! Domain Certification in LLMs	Cornelius Emde et.al.	2502.19320	null
2025-02-26	AI-Powered Bayesian Inference	Veronika Ročková et.al.	2502.19231	null
2025-02-26	HDM: Hybrid Diffusion Model for Unified Image Anomaly Detection	Zekang Weng et.al.	2502.19200	null
2025-02-27	INFO-SEDD: Continuous Time Markov Chains as Scalable Information Metrics Estimators	Alberto Foresti et.al.	2502.19183	null
2025-02-26	A Model-Centric Review of Deep Learning for Protein Design	Gregory W. Kyro et.al.	2502.19173	null
2025-02-27	RetinaRegen: A Hybrid Model for Readability and Detail Restoration in Fundus Images	Yuhan Tang et.al.	2502.19153	null
2025-02-26	Identification Under the Semantic Effective Secrecy Constraint	Abdalla Ibrahim et.al.	2502.19142	null
2025-02-26	Improving customer service with automatic topic detection in user emails	Bojana Bašaragin et.al.	2502.19115	null
2025-02-26	Modulation of the galactic cosmic ray spectrum in an anisotropic diffusion approach	V. D. Borisov et.al.	2502.19062	null
2025-02-26	A Dual-Purpose Framework for Backdoor Defense and Backdoor Amplification in Diffusion Models	Vu Tuan Truong Long et.al.	2502.19047	null
2025-02-26	OneRec: Unifying Retrieve and Rank with Generative Recommender and Iterative Preference Alignment	Jiaxin Deng et.al.	2502.18965	null
2025-02-26	DualSpec: Text-to-spatial-audio Generation via Dual-Spectrogram Guided Diffusion Model	Lei Zhao et.al.	2502.18952	null
2025-02-26	A Novel Topology Recovery Method for Low Voltage Distribution Networks	Sina Mohammadi et.al.	2502.18939	null
2025-02-25	K-LoRA: Unlocking Training-Free Fusion of Any Subject and Style LoRAs	Ziheng Ouyang et.al.	2502.18461	null
2025-02-25	ToMCAT: Theory-of-Mind for Cooperative Agents in Teams via Multiagent Diffusion Policies	Pedro Sequeira et.al.	2502.18438	null
2025-02-25	Sparse Bayesian Generative Modeling for Joint Parameter and Channel Estimation	Benedikt Böck et.al.	2502.18369	null
2025-02-25	ART: Anonymous Region Transformer for Variable Multi-Layer Transparent Image Generation	Yifan Pu et.al.	2502.18364	null
2025-02-25	Stretchable Capacitive and Resistive Strain Sensors: Accessible Manufacturing Using Direct Ink Writing	Lukas Cha et.al.	2502.18363	null
2025-02-25	Towards softerware: Enabling personalization of interactive data representations for users with disabilities	Frank Elavsky et.al.	2502.18348	link
2025-02-25	LDGen: Enhancing Text-to-Image Synthesis via Large Language Model-Driven Language Representation	Pengzhi Li et.al.	2502.18302	null
2025-02-26	Bayesian Computation in Deep Learning	Wenlong Chen et.al.	2502.18300	null
2025-02-26	Citrus: Leveraging Expert Cognitive Pathways in a Medical Language Model for Advanced Medical Decision Support	Guoxin Wang et.al.	2502.18274	link
2025-02-25	Imperfect Knowledge Management (IKM) in GEFRED (GENeralized model for Fuzzy RElational Databases)	Leoncio Jimenez et.al.	2502.18255	null
2025-02-25	A 3D Printed Quad-Ridged Flared Horn Antenna Feeder for Radio-Telescopes	Andreas Hofmann et.al.	2502.18243	null
2025-02-25	Causal AI-based Root Cause Identification: Research to Practice at Scale	Saurabh Jha et.al.	2502.18240	null
2025-02-25	Beyond the convexity assumption: Realistic tabular data generation under quantifier-free real linear constraints	Mihaela Cătălina Stoian et.al.	2502.18237	link
2025-02-25	Principled priors for Bayesian inference of circular models	Xiang Ye et.al.	2502.18223	null
2025-02-25	UASTrack: A Unified Adaptive Selection Framework with Modality-Customization in Single Object Tracking	He Wang et.al.	2502.18220	null
2025-02-24	Fractal Generative Models	Tianhong Li et.al.	2502.17437	link
2025-02-24	GCC: Generative Color Constancy via Diffusing a Color Checker	Chen-Wei Chang et.al.	2502.17435	null
2025-02-24	S4S: Solving for a Diffusion Model Solver	Eric Frankel et.al.	2502.17423	null
2025-02-24	X-Dancer: Expressive Music to Human Dance Video Generation	Zeyuan Chen et.al.	2502.17414	null
2025-02-24	What is a Good Question? Utility Estimation with LLM-based Simulations	Dong-Ho Lee et.al.	2502.17383	null
2025-02-25	KV-Edit: Training-Free Image Editing for Precise Background Preservation	Tianrui Zhu et.al.	2502.17363	link
2025-02-24	RELICT: A Replica Detection Framework for Medical Image Generation	Orhun Utku Aydin et.al.	2502.17360	link
2025-02-24	How Scientists Use Large Language Models to Program	Gabrielle O'Brien et.al.	2502.17348	null
2025-02-24	AnyTop: Character Animation Diffusion with Any Topology	Inbar Gat et.al.	2502.17327	link
2025-02-24	Turning Conversations into Workflows: A Framework to Extract and Evaluate Dialog Workflows for Service AI Agents	Prafulla Kumar Choubey et.al.	2502.17321	null
2025-02-24	Robust Federated Learning in Unreliable Wireless Networks: A Client Selection Approach	Yanmeng Wang et.al.	2502.17260	null
2025-02-24	VideoGrain: Modulating Space-Time Attention for Multi-grained Video Editing	Xiangpeng Yang et.al.	2502.17258	null
2025-02-24	Learning Image Fractals Using Chaotic Differentiable Point Splatting	Adarsh Djeacoumar et.al.	2502.17230	null
2025-02-24	Dimitra: Audio-driven Diffusion model for Expressive Talking Head Generation	Baptiste Chopin et.al.	2502.17198	null
2025-02-24	Unsupervised Accelerated MRI Reconstruction via Ground-Truth-Free Flow Matching	Xinzhe Luo et.al.	2502.17174	null
2025-02-21	One-step Diffusion Models with $f$ -Divergence Distribution Matching	Yilun Xu et.al.	2502.15681	null
2025-02-21	VaViM and VaVAM: Autonomous Driving through Video Generative Modeling	Florent Bartoccioni et.al.	2502.15672	link
2025-02-21	Overview of the data acquisition system architecture for the DarkSide-20k experiment	Maria Adriana Sabia et.al.	2502.15651	null
2025-02-21	WorldCraft: Photo-Realistic 3D World Creation and Customization via LLM Agents	Xinhang Liu et.al.	2502.15601	null
2025-02-21	Chats-Grid: An Iterative Retrieval Q&A Optimization Scheme Leveraging Large Model and Retrieval Enhancement Generation in smart grid	Yunfeng Li et.al.	2502.15583	null
2025-02-21	Enhancing RWKV-based Language Models for Long-Sequence Text Generation	Xinghan Pan et.al.	2502.15485	link
2025-02-21	Development and Performance Validation of a Versatile VLBI Digital Backend Using the ROACH2 Platform	Jiyun Li et.al.	2502.15446	null
2025-02-21	Modeling Infectious Diseases: From SIR Models to Diffusion-Based Approaches and Numerical Solutions	Ayesha Baig et.al.	2502.15439	null
2025-02-21	Efficiently Solving Discounted MDPs with Predictions on Transition Matrices	Lixing Lyu et.al.	2502.15345	null
2025-02-21	Bridging Bug Localization and Issue Fixing: A Hierarchical Localization Framework Leveraging Large Language Models	Jianming Chang et.al.	2502.15292	null
2025-02-21	BundleFlow: Deep Menus for Combinatorial Auctions by Diffusion-Based Optimization	Tonghan Wang et.al.	2502.15283	null
2025-02-21	CopyJudge: Automated Copyright Infringement Identification and Mitigation in Text-to-Image Diffusion Models	Shunchang Liu et.al.	2502.15278	null
2025-02-21	On the (In)Security of Non-resettable Device Identifiers in Custom Android Systems	Zikan Dong et.al.	2502.15270	null
2025-02-21	User Experience with LLM-powered Conversational Recommendation Systems: A Case of Music Recommendation	Sojeong Yun et.al.	2502.15229	null
2025-02-21	Lung-DDPM: Semantic Layout-guided Diffusion Models for Thoracic CT Image Synthesis	Yifan Jiang et.al.	2502.15204	link
2025-02-20	Improving the Diffusability of Autoencoders	Ivan Skorokhodov et.al.	2502.14831	null
2025-02-20	A Survey on Text-Driven 360-Degree Panorama Generation	Hai Wang et.al.	2502.14799	null
2025-02-20	Real-Time Device Reach Forecasting Using HLL and MinHash Data Sketches	Chandrashekar Muniyappa et.al.	2502.14785	null
2025-02-20	DC-ControlNet: Decoupling Inter- and Intra-Element Conditions in Image Generation with Diffusion Models	Hongji Yang et.al.	2502.14779	null
2025-02-20	Multi-dataset synergistic in supervised learning to pre-label structural components in point clouds from shell construction scenes	Lukas Rauch et.al.	2502.14721	null
2025-02-20	ReQFlow: Rectified Quaternion Flow for Efficient and High-Quality Protein Backbone Generation	Angxiao Yue et.al.	2502.14637	link
2025-02-20	A Theory for Conditional Generative Modeling on Multiple Data Sources	Rongzhen Wang et.al.	2502.14583	link
2025-02-20	Multiscale Byte Language Models -- A Hierarchical Architecture for Causal Million-Length Sequence Modeling	Eric Egli et.al.	2502.14553	link
2025-02-20	Dynamic Preference-based Multi-modal Trip Planning of Public Transport and Shared Mobility	Yimeng Zhang et.al.	2502.14528	null
2025-02-20	How Much Knowledge Can You Pack into a LoRA Adapter without Harming LLM?	Sergey Pletenev et.al.	2502.14502	link
2025-02-20	StructFlowBench: A Structured Flow Benchmark for Multi-turn Instruction Following	Jinnan Li et.al.	2502.14494	link
2025-02-20	How Jailbreak Defenses Work and Ensemble? A Mechanistic Investigation	Zhuohang Long et.al.	2502.14486	null
2025-02-20	Algorithms for min-buying in networks	Aaditya Bhardwaj et.al.	2502.14459	null
2025-02-20	PhotoDoodle: Learning Artistic Image Editing from Few-Shot Pairwise Data	Shijie Huang et.al.	2502.14397	link
2025-02-20	Enhancing Portuguese Variety Identification with Cross-Domain Approaches	Hugo Sousa et.al.	2502.14394	null
2025-02-19	IP-Composer: Semantic Composition of Visual Concepts	Sara Dorfman et.al.	2502.13951	null
2025-02-19	Image compositing is all you need for data augmentation	Ang Jia Ning Shermaine et.al.	2502.13936	null
2025-02-19	TESS 2: A Large-Scale Generalist Diffusion Language Model	Jaesung Tae et.al.	2502.13917	link
2025-02-19	DataSciBench: An LLM Agent Benchmark for Data Science	Dan Zhang et.al.	2502.13897	link
2025-02-19	Performance Comparison of Graph Representations Which Support Dynamic Graph Updates	Subhajit Sahu et.al.	2502.13862	link
2025-02-19	Reverse Markov Learning: Multi-Step Generative Models for Complex Distributions	Xinwei Shen et.al.	2502.13747	null
2025-02-19	Deep Learning for VWAP Execution in Crypto Markets: Beyond the Volume Curve	Remi Genet et.al.	2502.13722	link
2025-02-19	Beyond One-Size-Fits-All: Tailored Benchmarks for Efficient Evaluation	Peiwen Yuan et.al.	2502.13576	null
2025-02-19	ETS: Efficient Tree Search for Inference-Time Scaling	Coleman Hooper et.al.	2502.13575	link
2025-02-19	RestoreGrad: Signal Restoration Using Conditional Denoising Diffusion Models with Jointly Learned Prior	Ching-Hua Lee et.al.	2502.13574	null
2025-02-19	Diffusion Model Agnostic Social Influence Maximization in Hyperbolic Space	Hongliang Qiao et.al.	2502.13571	null
2025-02-19	Extracting Social Connections from Finnish Karelian Refugee Interviews Using LLMs	Joonatan Laato et.al.	2502.13566	null
2025-02-19	Controlling deposition and characterising dynamics of thin liquid films with high temporal and spatial resolution	G Le Lay et.al.	2502.13552	null
2025-02-19	VLAS: Vision-Language-Action Model With Speech Instructions For Customized Robot Manipulation	Wei Zhao et.al.	2502.13508	link
2025-02-19	Towards Lightweight, Adaptive and Attribute-Aware Multi-Aspect Controllable Text Generation with Large Language Models	Chenyu Zhu et.al.	2502.13474	null
2025-02-18	AV-Flow: Transforming Text to Audio-Visual Human-like Interactions	Aggelina Chatziagapi et.al.	2502.13133	null
2025-02-18	Is Noise Conditioning Necessary for Denoising Generative Models?	Qiao Sun et.al.	2502.13129	null
2025-02-18	HARP: A Taxonomy for Heterogeneous and Hierarchical Processors for Mixed-reuse Workloads	Raveesh Garg et.al.	2502.13113	null
2025-02-18	Score Matching Riemannian Diffusion Means	Frederik Möbius Rygaard et.al.	2502.13106	null
2025-02-18	tn4ml: Tensor Network Training and Customization for Machine Learning	Ema Puljak et.al.	2502.13090	link
2025-02-18	A Neural Difference-of-Entropies Estimator for Mutual Information	Haoran Ni et.al.	2502.13085	null
2025-02-18	Personalized Image Generation with Deep Generative Models: A Decade Survey	Yuxiang Wei et.al.	2502.13081	link
2025-02-18	Sailor2: Sailing in South-East Asia with Inclusive Multilingual LLMs	Longxu Dou et.al.	2502.12982	null
2025-02-18	Towards Variational Flow Matching on General Geometries	Olga Zaghen et.al.	2502.12981	null
2025-02-18	Does Training with Synthetic Data Truly Protect Privacy?	Yunpeng Zhao et.al.	2502.12976	link
2025-02-18	CooLBM: A Collaborative Open-Source Reactive Multi-Phase/Component Simulation Code via Lattice Boltzmann Method	R. Alamian et.al.	2502.12955	null
2025-02-18	Guaranteed Conditional Diffusion: 3D Block-based Models for Scientific Data Compression	Jaemoon Lee et.al.	2502.12951	null
2025-02-18	A Simplified and Numerically Stable Approach to the BG/NBD Churn Prediction model	Dylan Zammit et.al.	2502.12912	null
2025-02-18	Probabilistic neural operators for functional uncertainty quantification	Christopher Bülte et.al.	2502.12902	link
2025-02-18	CAST: Component-Aligned 3D Scene Reconstruction from an RGB Image	Kaixin Yao et.al.	2502.12894	null
2025-02-17	Diffusion Models without Classifier-free Guidance	Zhicong Tang et.al.	2502.12154	link
2025-02-17	Diffusion-Sharpening: Fine-tuning Diffusion Models with Denoising Trajectory Sharpening	Ye Tian et.al.	2502.12146	link
2025-02-17	Correlative X-ray and electron tomography for scale-bridging, quantitative analysis of complex, hierarchical particle systems	Alexander Götz et.al.	2502.12140	null
2025-02-17	LaM-SLidE: Latent Space Modeling of Spatial Dynamical Systems via Linked Entities	Florian Sestak et.al.	2502.12128	link
2025-02-17	Descriminative-Generative Custom Tokens for Vision-Language Models	Pramuditha Perera et.al.	2502.12095	null
2025-02-17	How compositional generalization and creativity improve as diffusion models are trained	Alessandro Favero et.al.	2502.12089	null
2025-02-17	AdaSplash: Adaptive Sparse Flash Attention	Nuno Gonçalves et.al.	2502.12082	link
2025-02-17	HumanGif: Single-View Human Diffusion with Generative Prior	Shoukang Hu et.al.	2502.12080	link
2025-02-17	A Survey on Bridging EEG Signals and Generative AI: From Image and Text to Beyond	Shreya Shukla et.al.	2502.12048	null
2025-02-17	Unsupervised Structural-Counterfactual Generation under Domain Shift	Krishn Vishwas Kher et.al.	2502.12013	null
2025-02-17	Characterizing Photorealism and Artifacts in Diffusion Model-Generated Images	Negar Kamali et.al.	2502.11989	link
2025-02-17	Design Considerations Based on Stability for a Class of TCP Algorithms	Sreekanth Prabhakar et.al.	2502.11983	null
2025-02-17	Image Inversion: A Survey from GANs to Diffusion and Beyond	Yinan Chen et.al.	2502.11974	link
2025-02-17	Generating Text from Uniform Meaning Representation	Emma Markle et.al.	2502.11973	link
2025-02-17	Massively Scaling Explicit Policy-conditioned Value Functions	Nico Bohlinger et.al.	2502.11949	null
2025-02-14	Region-Adaptive Sampling for Diffusion Transformers	Ziming Liu et.al.	2502.10389	null
2025-02-14	ReStyle3D: Scene-Level Appearance Transfer with Semantic Correspondences	Liyuan Zhu et.al.	2502.10377	null
2025-02-14	AffinityFlow: Guided Flows for Antibody Affinity Maturation	Can Chen et.al.	2502.10365	null
2025-02-14	Dimension-free Score Matching and Time Bootstrapping for Diffusion Models	Syamantak Kumar et.al.	2502.10354	null
2025-02-14	DiOpt: Self-supervised Diffusion for Constrained Optimization	Shutong Ding et.al.	2502.10330	null
2025-02-14	Generalised Parallel Tempering: Flexible Replica Exchange via Flows and Diffusions	Leo Zhang et.al.	2502.10328	null
2025-02-14	Analysis and Prediction of Coverage and Channel Rank for UAV Networks in Rural Scenarios with Foliage	Donggu Lee et.al.	2502.10324	null
2025-02-14	Probabilistic Super-Resolution for High-Fidelity Physical System Simulations with Uncertainty Quantification	Pengyu Zhang et.al.	2502.10280	null
2025-02-14	Dark Matter Attenuation Effects: Sensitivity Ceilings for Spin-Dependent and Spin-Independent Interactions	QUEST-DMC Collaboration et.al.	2502.10251	null
2025-02-14	Shaping Inductive Bias in Diffusion Models through Frequency-Based Noise Control	Thomas Jiralerspong et.al.	2502.10236	null
2025-02-14	Integrated Multi-Simulation Environments for Aerial Robotics Research	Pascal Goldschmid et.al.	2502.10218	link
2025-02-14	VideoDiff: Human-AI Video Co-Creation with Alternatives	Mina Huh et.al.	2502.10190	null
2025-02-14	Agentic End-to-End De Novo Protein Design for Tailored Dynamics Using a Language Diffusion Model	Bo Ni et.al.	2502.10173	null
2025-02-14	Modeling biases in binary decision-making within the generalized nonlinear q-voter model	Maciej Doniec et.al.	2502.10172	link
2025-02-14	Modeling and Simulating Emerging Memory Technologies: A Tutorial	Yun-Chih Chen et.al.	2502.10167	null
2025-02-13	Theoretical Benefit and Limitation of Diffusion Language Model	Guhao Feng et.al.	2502.09622	null
2025-02-13	RigAnything: Template-Free Autoregressive Rigging for Diverse 3D Assets	Isabella Liu et.al.	2502.09615	null
2025-02-13	Designing a Conditional Prior Distribution for Flow-Based Generative Models	Noam Issachar et.al.	2502.09611	null
2025-02-14	Score-of-Mixture Training: Training One-Step Generative Models Made Simple via Score Estimation of Mixture Distributions	Tejas Jayashankar et.al.	2502.09609	null
2025-02-13	Rolling Ahead Diffusion for Traffic Scene Simulation	Yunpeng Liu et.al.	2502.09587	null
2025-02-13	Memorization and Generalization in Generative Diffusion under the Manifold Hypothesis	Beatrice Achilli et.al.	2502.09578	null
2025-02-13	Wireless and passive pressure detection using magneto-mechanical resonances in process engineering	Timo Merbach et.al.	2502.09575	null
2025-02-13	DiffMS: Diffusion Generation of Molecules Conditioned on Mass Spectra	Montgomery Bohde et.al.	2502.09571	link
2025-02-13	Diffusing DeBias: a Recipe for Turning a Bug into a Feature	Massimiliano Ciranni et.al.	2502.09564	null
2025-02-13	Cryogenic SiPMs for the Optical Readout of DarkSide-20k	Giuseppe Matteucci et.al.	2502.09558	null
2025-02-13	Long-Term TalkingFace Generation via Motion-Prior Conditional Diffusion Model	Fei Shen et.al.	2502.09533	null
2025-02-13	SQ-GAN: Semantic Image Communications Using Masked Vector Quantization	Francesco Pezone et.al.	2502.09520	link
2025-02-13	Diffusion Models for Molecules: A Survey of Methods and Tasks	Liang Wang et.al.	2502.09511	link
2025-02-14	EQ-VAE: Equivariance Regularized Latent Space for Improved Generative Image Modeling	Theodoros Kouzelis et.al.	2502.09509	null
2025-02-13	AttentionSmithy: A Modular Framework for Rapid Transformer Development and Customization	Caleb Cranney et.al.	2502.09503	null
2025-02-12	SwiftSketch: A Diffusion Model for Image-to-Vector Sketch Generation	Ellie Arar et.al.	2502.08642	null
2025-02-12	CineMaster: A 3D-Aware and Controllable Framework for Cinematic Text-to-Video Generation	Qinghe Wang et.al.	2502.08639	null
2025-02-12	Learning Selection Cuts With Gradients	Mike Hance et.al.	2502.08615	null
2025-02-12	An Initial Condition-Dependent Neural Network Approach for Optimal Control Problems	Mominul Rubel et.al.	2502.08607	null
2025-02-12	Chasing Charge Carriers: Diffusion Dynamics in Mixed-n Quasi-Two-Dimensional Colloidal MAPbBr3 Perovskites	Ronja Maria Piehler et.al.	2502.08601	null
2025-02-12	Enhancing Diffusion Models Efficiency by Disentangling Total-Variance and Signal-to-Noise Ratio	Khaled Kahouli et.al.	2502.08598	link
2025-02-12	Light-A-Video: Training-free Video Relighting via Progressive Light Fusion	Yujie Zhou et.al.	2502.08590	link
2025-02-12	Ultrasound Image Generation using Latent Diffusion Models	Benoit Freiche et.al.	2502.08580	null
2025-02-12	Mapping the Landscape of Generative AI in Network Monitoring and Management	Giampaolo Bovenzi et.al.	2502.08576	null
2025-02-12	Statistically validated projection of bipartite signed networks	Anna Gallo et.al.	2502.08567	null
2025-02-12	Human-Centric Foundation Models: Perception, Generation and Agentic Modeling	Shixiang Tang et.al.	2502.08556	link
2025-02-12	BCDDM: Branch-Corrected Denoising Diffusion Model for Black Hole Image Generation	Ao liu et.al.	2502.08528	null
2025-02-12	FedMHO: Heterogeneous One-Shot Federated Learning Towards Resource-Constrained Edge Devices	Dezhong Yao et.al.	2502.08518	link
2025-02-12	One-Shot Federated Learning with Classifier-Free Diffusion Models	Obaidullah Zaland et.al.	2502.08488	null
2025-02-12	Computed fingertip touch for the instrumental control of musical sound with an excursion on the computed retinal afterimage	Staas de Jong et.al.	2502.08471	null
2025-02-11	Pippo: High-Resolution Multi-View Humans from a Single Image	Yash Kant et.al.	2502.07785	null
2025-02-11	MatSwap: Light-aware material transfers in images	Ivan Lopes et.al.	2502.07784	null
2025-02-11	Stay-Positive: A Case for Ignoring Real Image Features in Fake Image Detection	Anirudh Sundara Rajan et.al.	2502.07778	null
2025-02-11	The Economics of Large Language Models: Token Allocation, Fine-Tuning, and Optimal Pricing	Dirk Bergemann et.al.	2502.07736	null
2025-02-11	Revisiting Non-Acyclic GFlowNets in Discrete Environments	Nikita Morozov et.al.	2502.07735	link
2025-02-11	DOGlove: Dexterous Manipulation with a Low-Cost Open-Source Haptic Force Feedback Glove	Han Zhang et.al.	2502.07730	null
2025-02-11	Near-Optimal Sample Complexity in Reward-Free Kernel-Based Reinforcement Learning	Aya Kayal et.al.	2502.07715	null
2025-02-11	Magic 1-For-1: Generating One Minute Video Clips within One Minute	Hongwei Yi et.al.	2502.07701	link
2025-02-11	Steering Protein Family Design through Profile Bayesian Flow	Jingjing Gong et.al.	2502.07671	null
2025-02-11	Guiding Time-Varying Generative Models with Natural Gradients on Exponential Family Manifold	Song Liu et.al.	2502.07650	null
2025-02-11	Distributional Instrumental Variable Method	Anastasiia Holovchak et.al.	2502.07641	link
2025-02-11	Consistency Training with Physical Constraints	Che-Chia Chang et.al.	2502.07636	null
2025-02-11	Tractable Transformers for Flexible Conditional Generation	Anji Liu et.al.	2502.07616	null
2025-02-11	YOLO Network For Defect Detection In Optical lenses	Habib Yaseen et.al.	2502.07592	null
2025-02-11	Generative Modeling with Bayesian Sample Inference	Marten Lienen et.al.	2502.07580	link
2025-02-10	Lumina-Video: Efficient and Flexible Video Generation with Multi-scale Next-DiT	Dongyang Liu et.al.	2502.06782	null
2025-02-10	Learning an Optimal Assortment Policy under Observational Data	Yuxuan Han et.al.	2502.06777	null
2025-02-10	Enhancing Performance of Explainable AI Models with Constrained Concept Refinement	Geyu Liang et.al.	2502.06775	null
2025-02-10	Train for the Worst, Plan for the Best: Understanding Token Ordering in Masked Diffusions	Jaeyeon Kim et.al.	2502.06768	null
2025-02-10	History-Guided Video Diffusion	Kiwhan Song et.al.	2502.06764	null
2025-02-10	Señorita-2M: A High-Quality Instruction-based Dataset for General Video Editing by Video Specialists	Bojia Zi et.al.	2502.06734	null
2025-02-10	RSAttAE: An Information-Aware Attention-based Autoencoder Recommender System	Amirhossein Dadashzadeh Taromi et.al.	2502.06705	null
2025-02-10	No Trick, No Treat: Pursuits and Challenges Towards Simulation-free Training of Neural Samplers	Jiajun He et.al.	2502.06685	null
2025-02-10	Transfer Your Perspective: Controllable 3D Generation from Any Viewpoint in a Driving Scene	Tai-Yu Pan et.al.	2502.06682	null
2025-02-10	Filling a gap in materials mechanics: Nanoindentation at high constant strain rates upto $10^5 s^{-1}$	Lalith Kumar Bhaskar et.al.	2502.06668	null
2025-02-11	Unleashing the Potential of Pre-Trained Diffusion Models for Generalizable Person Re-Identification	Jiachen Li et.al.	2502.06619	link
2025-02-10	MaterialFusion: High-Quality, Zero-Shot, and Controllable Material Transfer with Diffusion Models	Kamil Garifullin et.al.	2502.06606	null
2025-02-10	Joint parameter and state estimation for regularized time-discrete multibody dynamics	Hannes Marklund et.al.	2502.06599	null
2025-02-10	A Large-scale AI-generated Image Inpainting Benchmark	Paschalis Giakoumoglou et.al.	2502.06593	null
2025-02-10	Optimizing Energy Efficiency in Subthreshold RISC-V Cores	Asbjørn Djupdal et.al.	2502.06588	null
2025-02-07	FlashVideo:Flowing Fidelity to Detail for Efficient High-Resolution Video Generation	Shilong Zhang et.al.	2502.05179	link
2025-02-07	Fillerbuster: Multi-View Scene Completion for Casual Captures	Ethan Weber et.al.	2502.05175	null
2025-02-07	Multitwine: Multi-Object Compositing with Text and Layout Control	Gemma Canet Tarrés et.al.	2502.05165	null
2025-02-07	Hummingbird: High Fidelity Image Generation via Multimodal Context Alignment	Minh-Quan Le et.al.	2502.05153	null
2025-02-07	Latent Swap Joint Diffusion for Long-Form Audio Generation	Yusheng Dai et.al.	2502.05130	null
2025-02-07	Beautiful Images, Toxic Words: Understanding and Addressing Offensive Text in Generated Images	Aditya Kumar et.al.	2502.05066	link
2025-02-07	Prospects for detecting generic fast-time features in the neutrino lightcurve of nearby supernovae in neutrino telescopes	Jakob Beise et.al.	2502.05024	null
2025-02-07	Seasonal Station-Keeping of Short Duration High Altitude Balloons using Deep Reinforcement Learning	Tristan K. Schuler et.al.	2502.05014	null
2025-02-07	Robust Graph Learning Against Adversarial Evasion Attacks via Prior-Free Diffusion-Based Structure Purification	Jiayi Luo et.al.	2502.05000	link
2025-02-07	C2GM: Cascading Conditional Generation of Multi-scale Maps from Remote Sensing Images Constrained by Geographic Features	Chenxing Sun et.al.	2502.04991	null
2025-02-07	FF7: A Code Package for High-throughput Calculations and Constructing Materials Database	Tiancheng Ma et.al.	2502.04984	null
2025-02-07	Generative-enhanced optimization for knapsack problems: an industry-relevant study	Yelyzaveta Vodovozova et.al.	2502.04928	null
2025-02-07	ARTInp: CBCT-to-CT Image Inpainting and Image Translation in Radiotherapy	Ricardo Coimbra Brioso et.al.	2502.04898	null
2025-02-07	Goku: Flow Based Video Generative Foundation Models	Shoufa Chen et.al.	2502.04896	null
2025-02-07	Training-free Task-oriented Grasp Generation	Jiaming Wang et.al.	2502.04873	null
2025-02-06	Can Grammarly and ChatGPT accelerate language change? AI-powered technologies and their impact on the English language: wordiness vs. conciseness	Karolina Rudnicka et.al.	2502.04324	null
2025-02-06	HOG-Diff: Higher-Order Guided Diffusion for Graph Generation	Yiming Huang et.al.	2502.04308	link
2025-02-06	MotionCanvas: Cinematic Shot Design with Controllable Image-to-Video Generation	Jinbo Xing et.al.	2502.04299	null
2025-02-06	Learning Real-World Action-Video Dynamics with Heterogeneous Masked Autoregression	Lirui Wang et.al.	2502.04296	null
2025-02-06	Breaking the Vault: A Case Study of the 2022 LastPass Data Breach	Jessica Gentles et.al.	2502.04287	null
2025-02-06	Non-Variational Quantum Random Access Optimization with Alternating Operator Ansatz	Zichang He et.al.	2502.04277	null
2025-02-06	Digital Gatekeeping: An Audit of Search Engine Results shows tailoring of queries on the Israel-Palestine Conflict	Íris Damião et.al.	2502.04266	null
2025-02-06	Realistic Image-to-Image Machine Unlearning via Decoupling and Knowledge Retention	Ayush K. Varshney et.al.	2502.04260	null
2025-02-06	TriNER: A Series of Named Entity Recognition Models For Hindi, Bengali & Marathi	Mohammed Amaan Dhamaskar et.al.	2502.04245	null
2025-02-06	NLP-Based .NET CLR Event Logs Analyzer	Maxim Stavtsev et.al.	2502.04219	link
2025-02-06	MRAMG-Bench: A BeyondText Benchmark for Multimodal Retrieval-Augmented Multimodal Generation	Qinhan Yu et.al.	2502.04176	link
2025-02-06	Diffusion-based mass map reconstruction from weak lensing data	Supranta S. Boruah et.al.	2502.04158	null
2025-02-06	Synthetic Datasets for Machine Learning on Spatio-Temporal Graphs using PDEs	Jost Arndt et.al.	2502.04140	link
2025-02-06	Llasa: Scaling Train-Time and Inference-Time Compute for Llama-based Speech Synthesis	Zhen Ye et.al.	2502.04128	link
2025-02-06	Generative Adversarial Networks Bridging Art and Machine Intelligence	Junhao Song et.al.	2502.04116	null
2025-02-05	Dress-1-to-3: Single Image to Simulation-Ready 3D Outfit with Diffusion Prior and Differentiable Physics	Xuan Li et.al.	2502.03449	null
2025-02-05	Masked Autoencoders Are Effective Tokenizers for Diffusion Models	Hao Chen et.al.	2502.03444	null
2025-02-05	Taking a Big Step: Large Learning Rates in Denoising Score Matching Prevent Memorization	Yu-Han Wu et.al.	2502.03435	null
2025-02-05	A Temporal Convolutional Network-Based Approach and a Benchmark Dataset for Colonoscopy Video Temporal Segmentation	Carlo Biffi et.al.	2502.03430	null
2025-02-05	TruePose: Human-Parsing-guided Attention Diffusion for Full-ID Preserving Pose Transfer	Zhihong Xu et.al.	2502.03426	null
2025-02-05	Can Text-to-Image Generative Models Accurately Depict Age? A Comparative Study on Synthetic Portrait Generation and Age Estimation	Alexey A. Novikov et.al.	2502.03420	null
2025-02-05	A Mixture-Based Framework for Guiding Diffusion Models	Yazid Janati et.al.	2502.03332	null
2025-02-05	An efficient end-to-end computational framework for the generation of ECG calibrated volumetric models of human atrial electrophysiology	Elena Zappon et.al.	2502.03322	null
2025-02-05	Simplifying Formal Proof-Generating Models with ChatGPT and Basic Searching Techniques	Sangjun Han et.al.	2502.03321	null
2025-02-05	Electronic properties and transport in metal/2D material/metal vertical junctions	Gaëlle Bigeard et.al.	2502.03318	null
2025-02-05	Posterior SBC: Simulation-Based Calibration Checking Conditional on Data	Teemu Säilynoja et.al.	2502.03279	link
2025-02-05	General Time-series Model for Universal Knowledge Representation of Multivariate Time-Series data	Cheng He et.al.	2502.03264	null
2025-02-05	Practical Introduction to FEM with GMSH: A MATLAB/Octave Perspective	Victor Dominguez et.al.	2502.03248	null
2025-02-05	MotionAgent: Fine-grained Controllable Video Generation via Motion Field Agent	Xinyao Liao et.al.	2502.03207	null
2025-02-05	Low-cost analog signal chain for transmit-receive circuits of passive induction-based resonators	Fabian Mohn et.al.	2502.03202	null
2025-02-04	COCONut-PanCap: Joint Panoptic Segmentation and Grounded Captions for Fine-Grained Understanding and Generation	Xueqing Deng et.al.	2502.02589	null
2025-02-04	Calibrated Multi-Preference Optimization for Aligning Diffusion Models	Kyungmin Lee et.al.	2502.02588	null
2025-02-04	Open Materials Generation with Stochastic Interpolants	Philipp Hoellmer et.al.	2502.02582	null
2025-02-04	A Family-Based Approach to Safety Cases for Controlled Airspaces in Small Uncrewed Aerial Systems	Michael C. Hunter et.al.	2502.02559	null
2025-02-04	Diff9D: Diffusion-Based Domain-Generalized Category-Level 9-DoF Object Pose Estimation	Jian Liu et.al.	2502.02525	link
2025-02-04	Privacy Attacks on Image AutoRegressive Models	Antoni Kowalczuk et.al.	2502.02514	link
2025-02-04	Generative Modeling on Lie Groups via Euclidean Generalized Score Matching	Marco Bertolini et.al.	2502.02513	null
2025-02-04	Learning to generate physical ocean states: Towards hybrid climate modeling	Etienne Meunier et.al.	2502.02499	null
2025-02-04	Do Graph Diffusion Models Accurately Capture and Generate Substructure Distributions?	Xiyuan Wang et.al.	2502.02488	null
2025-02-04	Distributional Diffusion Models with Scoring Rules	Valentin De Bortoli et.al.	2502.02483	null
2025-02-04	Style transfer as data augmentation: evaluating unpaired image-to-image translation models in mammography	Emir Ahmed et.al.	2502.02475	null
2025-02-04	Towards Consistent and Controllable Image Synthesis for Face Editing	Mengting Wei et.al.	2502.02465	null
2025-02-04	Personalization Toolkit: Training Free Personalization of Large Vision Language Models	Soroush Seifi et.al.	2502.02452	null
2025-02-04	Sparse Data Generation Using Diffusion Models	Phil Ostheimer et.al.	2502.02448	null
2025-02-04	TransformDAS: Mapping Φ-OTDR Signals to Riemannian Manifold for Robust Classification	Jiaju Kang et.al.	2502.02428	null
2025-01-31	LiDAR Loop Closure Detection using Semantic Graphs with Graph Attention Networks	Liudi Yang et.al.	2501.19382	link
2025-01-31	Creative Problem-Solving: A Study with Blind and Low Vision Software Professionals	Karina Kohl et.al.	2501.19380	null
2025-01-31	Beyond Fixed Horizons: A Theoretical Framework for Adaptive Denoising Diffusions	Sören Christensen et.al.	2501.19373	null
2025-01-31	Addressing the correlation of Stokes-shifted photons emitted from two quantum emitters	Adrián Juan-Delgado et.al.	2501.19356	null
2025-01-31	Do Large Multimodal Models Solve Caption Generation for Scientific Figures? Lessons Learned from SCICAP Challenge 2023	Ting-Yao E. Hsu et.al.	2501.19353	null
2025-01-31	Low-cost Microfluidic Testbed for Molecular Communications with Integrated Hydrodynamic Gating and Screen-printed Sensors	Maide Miray Albay et.al.	2501.19341	null
2025-01-31	Pathological MRI Segmentation by Synthetic Pathological Data Generation in Fetuses and Neonates	Misha P. T Kaandorp et.al.	2501.19338	null
2025-01-31	Analysis of LLMs vs Human Experts in Requirements Engineering	Cory Hymel et.al.	2501.19297	null
2025-01-31	Medical Semantic Segmentation with Diffusion Pretrain	David Li et.al.	2501.19265	null
2025-01-31	Inference-Time Text-to-Video Alignment with Diffusion Latent Beam Search	Yuta Oshima et.al.	2501.19252	null
2025-01-31	Single cell resolution 3D imaging and segmentation within intact live tissues	G. Paci et.al.	2501.19203	link
2025-01-31	A Variational Perspective on Generative Protein Fitness Optimization	Lea Bogensperger et.al.	2501.19200	null
2025-01-31	PSyDUCK: Training-Free Steganography for Latent Diffusion	Georgia Channing et.al.	2501.19172	null
2025-01-31	RMDM: Radio Map Diffusion Model with Physics Informed	Haozhe Jia et.al.	2501.19160	link
2025-01-31	A theoretical framework for overfitting in energy-based modeling	Giovanni Catania et.al.	2501.19158	null
2025-01-30	Diffusion Autoencoders are Scalable Image Tokenizers	Yinbo Chen et.al.	2501.18593	null
2025-01-30	DiffusionRenderer: Neural Inverse and Forward Rendering with Video Diffusion Models	Ruofan Liang et.al.	2501.18590	null
2025-01-30	WILDCHAT-50M: A Deep Dive Into the Role of Synthetic Data in Post-Training	Benjamin Feuer et.al.	2501.18511	link
2025-01-30	Examining the Expanding Role of Synthetic Data Throughout the AI Development Pipeline	Shivani Kapania et.al.	2501.18493	null
2025-01-30	CodeBrain: Impute Any Brain MRI via Instance-specific Scalar-quantized Codes	Yicheng Wu et.al.	2501.18328	null
2025-01-30	How to Select Datapoints for Efficient Human Evaluation of NLG Models?	Vilém Zouhar et.al.	2501.18251	link
2025-01-30	Free-T2M: Frequency Enhanced Text-to-Motion Diffusion Model With Consistency Loss	Wenshuo Chen et.al.	2501.18232	link
2025-01-30	Inverse source problem of sub-diffusion of variable exponent	Zhiyuan Li et.al.	2501.18228	null
2025-01-30	Behavior Modeling Space Reconstruction for E-Commerce Search	Yejing Wang et.al.	2501.18216	null
2025-01-30	Joint Design and Pricing of Extended Warranties for Multiple Automobiles with Different Price Bands	Yajing Chen et.al.	2501.18203	null
2025-01-30	Advancing Personalized Federated Learning: Integrative Approaches with AI for Enhanced Privacy and Customization	Kevin Cooper et.al.	2501.18174	null
2025-01-31	RepoAudit: An Autonomous LLM-Agent for Repository-Level Code Auditing	Jinyao Guo et.al.	2501.18160	null
2025-01-30	The Dilemma of Building Do-It-Yourself (DIY) Solutions for Workplace Accessibility	Yoonha Cha et.al.	2501.18148	null
2025-01-30	HyperZero: A Customized End-to-End Auto-Tuning System for Recommendation with Hourly Feedback	Xufeng Cai et.al.	2501.18126	null
2025-01-29	SAeUron: Interpretable Concept Unlearning in Diffusion Models with Sparse Autoencoders	Bartosz Cywiński et.al.	2501.18052	link
2025-01-29	Enriched Immersed Finite Element and Isogeometric Analysis -- Algorithms and Data Structures	Nils Wunsch et.al.	2501.17853	null
2025-01-29	acoupi: An Open-Source Python Framework for Deploying Bioacoustic AI Models on Edge Devices	Aude Vuilliomenet et.al.	2501.17841	link
2025-01-29	Atomic Transfer Graphs: Secure-by-design Protocols for Heterogeneous Blockchain Ecosystems	Stephan Dübler et.al.	2501.17786	null
2025-01-29	Generative Unordered Flow for Set-Structured Data Generation	Yangming Li et.al.	2501.17770	null
2025-01-29	Formally Verified Binary-level Pointer Analysis	Freek Verbeek et.al.	2501.17766	null
2025-01-29	In-IDE Programming Courses: Learning Software Development in a Real-World Setting	Anastasiia Birillo et.al.	2501.17747	null
2025-01-29	Testing Research Software: An In-Depth Survey of Practices, Methods, and Tools	Nasir U. Eisty et.al.	2501.17739	null
2025-01-29	A technical review of multi-omics data integration methods: from classical statistical to deep generative approaches	Ana R. Baião et.al.	2501.17729	null
2025-01-29	VICCA: Visual Interpretation and Comprehension of Chest X-ray Anomalies in Generated Report Without Human Feedback	Sayeh Gholipour Picha et.al.	2501.17726	null
2025-01-29	Source-Channel Separation Theorems for Distortion Perception Coding	Chao Tian et.al.	2501.17706	null
2025-01-29	Distinguished Quantized Guidance for Diffusion-based Sequence Recommendation	Wenyu Mao et.al.	2501.17670	null
2025-01-29	In-Context Meta LoRA Generation	Yihua Shao et.al.	2501.17635	null
2025-01-29	Semantic Consistency Regularization with Large Language Models for Semi-supervised Sentiment Analysis	Kunrong Li et.al.	2501.17598	null
2025-01-29	Music2Latent2: Audio Compression with Summary Embeddings and Autoregressive Decoding	Marco Pasini et.al.	2501.17578	null
2025-01-29	Exploring the Potential of Wireless-enabled Multi-Chip AI Accelerators	Emmanuel Irabor et.al.	2501.17567	null
2025-01-28	CubeDiff: Repurposing Diffusion-Based Image Models for Panorama Generation	Nikolai Kalischek et.al.	2501.17162	null
2025-01-28	IC-Portrait: In-Context Matching for View-Consistent Personalized Portrait	Han Yang et.al.	2501.17159	null
2025-01-28	First Axion-Like Particle Results from a Broadband Search for Wave-Like Dark Matter in the 44 to 52 $μ$ eV Range with a Coaxial Dish Antenna	Gabe Hoshino et.al.	2501.17119	null
2025-01-28	Goodness of Fit for Bayesian Generative Models with Applications in Population Genetics	Guillaume Le Mailloux et.al.	2501.17107	link
2025-01-28	DataLens: ML-Oriented Interactive Tabular Data Quality Dashboard	Mohamed Abdelaal et.al.	2501.17074	null
2025-01-28	Generative diffusion models from a PDE perspective	Fei Cao et.al.	2501.17054	null
2025-01-28	MIDI-GPT: A Controllable Generative Model for Computer-Assisted Multitrack Music Composition	Philippe Pasquier et.al.	2501.17011	null
2025-01-28	Generative quantum combinatorial optimization by means of a novel conditional generative quantum eigensolver	Shunya Minami et.al.	2501.16986	null
2025-01-28	A totally non-compensatory multi-criteria method for evaluating and improving level of satisfaction (LoS): proposal and application on Airport Terminal of Passengers	Phelipe Medeiros da Rocha et.al.	2501.16979	null
2025-01-28	Adversarial Masked Autoencoder Purifier with Defense Transferability	Yuan-Chih Chen et.al.	2501.16904	null
2025-01-28	Extending Information Bottleneck Attribution to Video Sequences	Veronika Solopova et.al.	2501.16889	link
2025-01-28	DIRIGENt: End-To-End Robotic Imitation of Human Demonstrations Based on a Diffusion Model	Josua Spisak et.al.	2501.16800	null
2025-01-28	Algorithm for Automatic Legislative Text Consolidation	Matias Etcheverry et.al.	2501.16794	null
2025-01-28	Exponential Family Attention	Kevin Christian Wibisono et.al.	2501.16790	link
2025-01-28	FlexMotion: Lightweight, Physics-Aware, and Controllable Human Motion Generation	Arvin Tashakori et.al.	2501.16778	null
2025-01-27	RelightVid: Temporal-Consistent Diffusion Model for Video Relighting	Ye Fang et.al.	2501.16330	null
2025-01-27	Movement- and Traffic-based User Identification in Commercial Virtual Reality Applications: Threats and Opportunities	Sara Baldoni et.al.	2501.16326	link
2025-01-27	Evaluating The Performance of Using Large Language Models to Automate Summarization of CT Simulation Orders in Radiation Oncology	Meiyun Cao et.al.	2501.16309	null
2025-01-27	RAPID: Retrieval-Augmented Parallel Inference Drafting for Text-Based Video Event Retrieval	Long Nguyen et.al.	2501.16303	null
2025-01-27	Congested Crossing Pedestrian Traffic Flow : Dispersion vs. Transport in Crowded Areas	Mariam Al Khatib et.al.	2501.16275	null
2025-01-27	Improving DBMS Scheduling Decisions with Fine-grained Performance Prediction on Concurrent Queries -- Extended	Ziniu Wu et.al.	2501.16256	null
2025-01-27	A foundation model for human-AI collaboration in medical literature mining	Zifeng Wang et.al.	2501.16255	null
2025-01-27	UDBE: Unsupervised Diffusion-based Brightness Enhancement in Underwater Images	Tatiana Taís Schein et.al.	2501.16211	link
2025-01-27	HERITRACE: A User-Friendly Semantic Data Editor with Change Tracking and Provenance Management for Cultural Heritage Institutions	Arcangelo Massari et.al.	2501.16197	null
2025-01-27	Multi-front dynamics in spatially inhomogeneous Allen-Cahn equations	Robbin Bastiaansen et.al.	2501.16195	null
2025-01-27	BAG: Body-Aligned 3D Wearable Asset Generation	Zhongjin Luo et.al.	2501.16177	null
2025-01-27	Efficient Portrait Matte Creation With Layer Diffusion and Connectivity Priors	Zhiyuan Lu et.al.	2501.16147	null
2025-01-27	Disruption-aware Microservice Re-orchestration for Cost-efficient Multi-cloud Deployments	Marco Zambianco et.al.	2501.16143	null
2025-01-27	Using Generative Models to Produce Realistic Populations of UK Windstorms	Yee Chun Tsoi et.al.	2501.16110	null
2025-01-27	ARFlow: Autogressive Flow with Hybrid Linear Attention	Mude Hui et.al.	2501.16085	null
2025-01-24	An Attentive Graph Agent for Topology-Adaptive Cyber Defence	Ilya Orson Sandoval et.al.	2501.14700	link
2025-01-24	Diffusion based Text-to-Music Generationwith Global and Local Text based Conditioning	Jisi Zhang et.al.	2501.14680	null
2025-01-24	End-to-end workflow for machine learning-based qubit readout with QICK and hls4ml	Giuseppe Di Guglielmo et.al.	2501.14663	null
2025-01-24	Towards Scalable Topological Regularizers	Hiu-Tung Wong et.al.	2501.14641	null
2025-01-24	Single-neuron deep generative model uncovers underlying physics of neuronal activity in Ca imaging data	Jordi Abante et.al.	2501.14615	null
2025-01-24	Visual Localization via Semantic Structures in Autonomous Photovoltaic Power Plant Inspection	Viktor Kozák et.al.	2501.14587	null
2025-01-24	Training-Free Style and Content Transfer by Leveraging U-Net Skip Connections in Stable Diffusion 2.*	Ludovica Schaerf et.al.	2501.14524	null
2025-01-24	Pesti-Gen: Unleashing a Generative Molecule Approach for Toxicity Aware Pesticide Design	Taehan Kim et.al.	2501.14469	null
2025-01-24	CENTS: Generating synthetic electricity consumption time series for rare and unseen scenarios	Michael Fuest et.al.	2501.14426	null
2025-01-24	DeepFlow: Serverless Large Language Model Serving at Scale	Junhao Hu et.al.	2501.14417	null
2025-01-24	Uncovering the bias in the evidence for dynamical dark energy through minimal and generalized modeling approaches	Ziad Sakr et.al.	2501.14366	null
2025-01-24	Advancing data-driven broadband seismic wavefield simulation with multi-conditional diffusion model	Zhengfa Bi et.al.	2501.14348	null
2025-01-24	HorNets: Learning from Discrete and Continuous Signals with Routing Neural Networks	Boshko koloski et.al.	2501.14346	link
2025-01-24	Stochastic Method for Delayed Neutron Precursors Transport in Liquid Fuel	Mathis Caprais et.al.	2501.14332	null
2025-01-24	PAID: A Framework of Product-Centric Advertising Image Design	Hongyu Chen et.al.	2501.14316	null
2025-01-23	IMAGINE-E: Image Generation Intelligence Evaluation of State-of-the-art Text-to-Image Models	Jiayi Lei et.al.	2501.13920	null
2025-01-23	Improving Video Generation with Human Feedback	Jie Liu et.al.	2501.13918	null
2025-01-23	Binary Diffusion Probabilistic Model	Vitaliy Kinakh et.al.	2501.13915	null
2025-01-23	Privacy-Preserving Personalized Federated Prompt Learning for Multimodal Large Language Models	Linh Tran et.al.	2501.13904	null
2025-01-23	A RAG-Based Institutional Assistant	Gustavo Kuratomi et.al.	2501.13880	null
2025-01-23	Unveiling the Power of Noise Priors: Enhancing Diffusion Models for Mobile Traffic Prediction	Zhi Sheng et.al.	2501.13794	null
2025-01-23	An Efficient Diffusion-based Non-Autoregressive Solver for Traveling Salesman Problem	Mingzhao Wang et.al.	2501.13767	link
2025-01-23	A Mutual Information Perspective on Multiple Latent Variable Generative Models for Positive View Generation	Dario Serez et.al.	2501.13718	null
2025-01-23	YOLO11-JDE: Fast and Accurate Multi-Object Tracking with Self-Supervised Re-ID	Iñaki Erregue et.al.	2501.13710	link
2025-01-23	Training-Free Consistency Pipeline for Fashion Repose	Potito Aghilar et.al.	2501.13692	null
2025-01-23	A Transformer-based Autoregressive Decoder Architecture for Hierarchical Text Classification	Younes Yousef et.al.	2501.13598	link
2025-01-23	Funnelling super-resolution STED microscopy through multimode fibres	André Gomes et.al.	2501.13572	null
2025-01-24	One-Prompt-One-Story: Free-Lunch Consistent Text-to-Image Generation Using a Single Prompt	Tao Liu et.al.	2501.13554	link
2025-01-23	Diffusion-based Perceptual Neural Video Compression with Temporal Diffusion Information Reuse	Wenzhuo Ma et.al.	2501.13528	null
2025-01-23	LDR-Net: A Novel Framework for AI-generated Image Detection via Localized Discrepancy Representation	JiaXin Chen et.al.	2501.13475	null
2025-01-22	Accelerate High-Quality Diffusion Models with Inner Loop Feedback	Matthew Gwilliam et.al.	2501.13107	null
2025-01-22	Robust Representation Consistency Model via Contrastive Denoising	Jiachen Lei et.al.	2501.13094	link
2025-01-22	Innovative Web Tool for Remote Data Acquisition and Analysis: Customized for SKA Low frequency Beamforming Test Bed LPDA Array at Gauribidanur Radio Observatory	Anumanchi Agastya Sai Ram Likhit et.al.	2501.13090	null
2025-01-22	Orchid: Image Latent Diffusion for Joint Appearance and Geometry Generation	Akshay Krishnan et.al.	2501.13087	null
2025-01-22	Robust Body Composition Analysis by Generating 3D CT Volumes from Limited 2D Slices	Lianrui Zuo et.al.	2501.13071	null
2025-01-22	Beyond the Lungs: Extending the Field of View in Chest CT with Latent Diffusion Models	Lianrui Zuo et.al.	2501.13068	null
2025-01-22	Neural network enhanced cross entropy benchmark for monitored circuits	Yangrui Hu et.al.	2501.13005	null
2025-01-22	Low-dimensional adaptation of diffusion models: Convergence in total variation	Jiadong Liang et.al.	2501.12982	null
2025-01-22	Accessible Smart Contracts Verification: Synthesizing Formal Models with Tamed LLMs	Jan Corazza et.al.	2501.12972	null
2025-01-22	Observation of Strong Nonreciprocal Thermal Emission	Zhenong Zhang et.al.	2501.12947	null
2025-01-22	3D Object Manipulation in a Single Image using Generative Models	Ruisi Zhao et.al.	2501.12935	null
2025-01-22	Reinforcement learning Based Automated Design of Differential Evolution Algorithm for Black-box Optimization	Xu Yang et.al.	2501.12881	null
2025-01-22	CrossDiff: Diffusion Probabilistic Model With Cross-conditional Encoder-Decoder for Crack Segmentation	Xianglong Shi et.al.	2501.12860	null
2025-01-22	AMM-Diff: Adaptive Multi-Modality Diffusion Network for Missing Modality Imputation	Aghiles Kebaili et.al.	2501.12840	null
2025-01-22	Inverse Design of Chiral Structures for Giant Helical Dichroism	Chia-Chun Pan et.al.	2501.12825	null
2025-01-21	Towards Affordance-Aware Articulation Synthesis for Rigged Objects	Yu-Chu Yu et.al.	2501.12393	null
2025-01-22	GPS as a Control Signal for Image Generation	Chao Feng et.al.	2501.12390	null
2025-01-21	Audio Texture Manipulation by Exemplar-Based Analogy	Kan Jen Cheng et.al.	2501.12385	null
2025-01-21	Accelerating Pulsar Parameter Estimation Using Convolutional Neural Networks	Greg Olmschenk et.al.	2501.12383	null
2025-01-21	DiffDoctor: Diagnosing Image Diffusion Models Before Treating	Yiyang Wang et.al.	2501.12382	null
2025-01-22	Video Depth Anything: Consistent Depth Estimation for Super-Long Videos	Sili Chen et.al.	2501.12375	null
2025-01-21	FuocChuVIP123 at CoMeDi Shared Task: Disagreement Ranking with XLM-Roberta Sentence Embeddings and Deep Neural Regression	Phuoc Duong Huy Chu et.al.	2501.12336	null
2025-01-21	VipDiff: Towards Coherent and Diverse Video Inpainting via Training-free Denoising Diffusion Models	Chaohao Xie et.al.	2501.12267	null
2025-01-21	Joint Reconstruction and Motion Estimation in Sparse-View 4DCT Using Diffusion Models within a Blind Inverse Problem Framework	Antoine De Paepe et.al.	2501.12249	null
2025-01-21	InsTALL: Context-aware Instructional Task Assistance with Multi-modal Large Language Models	Pha Nguyen et.al.	2501.12231	null
2025-01-21	TokenVerse: Versatile Multi-concept Personalization in Token Modulation Space	Daniel Garibi et.al.	2501.12224	null
2025-01-21	Early Detection and Classification of Breast Cancer Using Deep Learning Techniques	Mst. Mumtahina Labonno et.al.	2501.12217	null
2025-01-22	Hunyuan3D 2.0: Scaling Diffusion Models for High Resolution Textured 3D Assets Generation	Zibo Zhao et.al.	2501.12202	link
2025-01-21	An End-to-End Approach for Korean Wakeword Systems with Speaker Authentication	Geonwoo Seo et.al.	2501.12194	link
2025-01-21	ComposeAnyone: Controllable Layout-to-Human Generation with Decoupled Multimodal Conditions	Shiyue Zhang et.al.	2501.12173	link
2025-01-17	Zero-Shot Monocular Scene Flow Estimation in the Wild	Yiqing Liang et.al.	2501.10357	null
2025-01-17	Agent4Edu: Generating Learner Response Data by Generative Agents for Intelligent Education Systems	Weibo Gao et.al.	2501.10332	link
2025-01-17	DiffStereo: High-Frequency Aware Diffusion Model for Stereo Image Restoration	Huiyun Cao et.al.	2501.10325	null
2025-01-17	SEANN: A Domain-Informed Neural Network for Epidemiological Insights	Jean-Baptiste Guimbaud et.al.	2501.10273	null
2025-01-17	Drift time calibration of the ultra-Low material budget GEM-based TPC for MIXE	X. Zhao et.al.	2501.10249	null
2025-01-17	Over-the-Air Multi-Sensor Inference with Neural Networks Using Memristor-Based Analog Computing	Busra Tegin et.al.	2501.10245	null
2025-01-17	Modelling Activity Scheduling Behaviour with Deep Generative Machine Learning	Fred Shone et.al.	2501.10221	null
2025-01-17	Adaptive Clustering for Efficient Phenotype Segmentation of UAV Hyperspectral Data	Ciem Cornelissen et.al.	2501.10199	null
2025-01-17	Optimizing Structured-Sparse Matrix Multiplication in RISC-V Vector Processors	Vasileios Titopoulos et.al.	2501.10189	null
2025-01-17	Convex Physics Informed Neural Networks for the Monge-Ampère Optimal Transport Problem	Alexandre Caboussat et.al.	2501.10162	null
2025-01-17	AI-Generated Music Detection and its Challenges	Darius Afchar et.al.	2501.10111	link
2025-01-17	DiffVSR: Enhancing Real-World Video Super-Resolution with Diffusion Models for Advanced Visual Quality and Temporal Consistency	Xiaohui Li et.al.	2501.10110	null
2025-01-17	landmarker: a Toolkit for Anatomical Landmark Localization in 2D/3D Images	Jef Jonkers et.al.	2501.10098	link
2025-01-17	Conditional Latent Diffusion-Based Speech Enhancement Via Dual Context Learning	Shengkui Zhao et.al.	2501.10052	link
2025-01-17	DiffuEraser: A Diffusion Model for Video Inpainting	Xiaowen Li et.al.	2501.10018	link
2025-01-16	SynthLight: Portrait Relighting with Diffusion Model by Learning to Re-render Synthetic Faces	Sumit Chaturvedi et.al.	2501.09756	null
2025-01-16	Learnings from Scaling Visual Tokenizers for Reconstruction and Generation	Philippe Hansen-Estruch et.al.	2501.09755	null
2025-01-16	KU AIGEN ICL EDI@BC8 Track 3: Advancing Phenotype Named Entity Recognition and Normalization for Dysmorphology Physical Examination Reports	Hajung Kim et.al.	2501.09744	null
2025-01-16	Inference-Time Scaling for Diffusion Models beyond Scaling Denoising Steps	Nanye Ma et.al.	2501.09732	null
2025-01-16	Comparative Insights from 12 Machine Learning Models in Extracting Economic Ideology from Political Text	Jihed Ncib et.al.	2501.09719	null
2025-01-16	Reward-Guided Controlled Generation for Inference-Time Alignment in Diffusion Models: Tutorial and Review	Masatoshi Uehara et.al.	2501.09685	null
2025-01-16	A Survey of Research in Large Language Models for Electronic Design Automation	Jingyu Pan et.al.	2501.09655	null
2025-01-16	Fabrication of Mode-Matched, Low-Loss Optical Resonators by Combination of FIB-Milling and CO $_2$ Laser Ablation	Patrick Maier et.al.	2501.09577	null
2025-01-16	AnyStory: Towards Unified Single and Multiple Subject Personalization in Text-to-Image Generation	Junjie He et.al.	2501.09503	link
2025-01-16	Pruning for Sparse Diffusion Models based on Gradient Flow	Ben Wan et.al.	2501.09464	null
2025-01-16	"A Great Start, But...": Evaluating LLM-Generated Mind Maps for Information Mapping in Video-Based Design	Tianhao He et.al.	2501.09457	null
2025-01-16	CaPa: Carve-n-Paint Synthesis for Efficient 4K Textured Mesh Generation	Hwan Heo et.al.	2501.09433	link
2025-01-16	Towards a Framework for Enterprise Architecture in Mobile Government: A Case Study	Son Pham et.al.	2501.09401	null
2025-01-16	Contract-Inspired Contest Theory for Controllable Image Generation in Mobile Edge Metaverse	Guangyuan Liu et.al.	2501.09391	null
2025-01-16	Identification of Traditional Medicinal Plant Leaves Using an effective Deep Learning model and Self-Curated Dataset	Deepjyoti Chetia et.al.	2501.09363	null
2025-01-15	How Do Generative Models Draw a Software Engineer? A Case Study on Stable Diffusion Bias	Tosin Fadahunsi et.al.	2501.09014	link
2025-01-15	SimGen: A Diffusion-Based Framework for Simultaneous Surgical Image and Segmentation Mask Generation	Aditya Bhat et.al.	2501.09008	null
2025-01-15	CrystalGRW: Generative Modeling of Crystal Structures with Targeted Properties via Geodesic Random Walks	Krit Tangsongcharoen et.al.	2501.08998	link
2025-01-15	VECT-GAN: A variationally encoded generative model for overcoming data scarcity in pharmaceutical science	Youssef Abdalla et.al.	2501.08995	link
2025-01-15	RepVideo: Rethinking Cross-Layer Representation for Video Generation	Chenyang Si et.al.	2501.08994	null
2025-01-15	CityDreamer4D: Compositional Generative Model of Unbounded 4D Cities	Haozhe Xie et.al.	2501.08983	link
2025-01-15	Learning to Extract Cross-Domain Aspects and Understanding Sentiments Using Large Language Models	Karukriti Kaushik Ghosh et.al.	2501.08974	null
2025-01-15	Karatsuba Matrix Multiplication and its Efficient Custom Hardware Implementations	Trevor E. Pogue et.al.	2501.08889	link
2025-01-15	Connecting SPDE to SGMs	Junsu Seo et.al.	2501.08877	null
2025-01-16	Silent Abandonment in Text-Based Contact Centers: Identifying, Quantifying, and Mitigating its Operational Impacts	Antonio Castellanos et.al.	2501.08869	null
2025-01-15	Boosting Diffusion Guidance via Learning Degradation-Aware Models for Blind Super Resolution	Shao-Hao Lu et.al.	2501.08819	link
2025-01-15	Securities Transaction Settlement Optimization on superconducting quantum devices	Francesco Martini et.al.	2501.08794	null
2025-01-15	Near-Field ISAC: Synergy of Dual-Purpose Codebooks and Space-Time Adaptive Processing	Ahmed Hussain et.al.	2501.08776	null
2025-01-15	Adaptive Approximation Schemes for Matching Queues	Alireza AmaniHamedani et.al.	2501.08775	null
2025-01-15	An Ultra-Wideband Dual Polarization Antenna Array for the Detection and Localization of Bright Fast Radio Transients in the Milky Way	Diego Gallardo et.al.	2501.08764	null
2025-01-14	DAViD: Modeling Dynamic Affordance of 3D Objects using Pre-trained Video Diffusion Models	Hyeonwoo Kim et.al.	2501.08333	null
2025-01-14	MangaNinja: Line Art Colorization with Precise Reference Following	Zhiheng Liu et.al.	2501.08332	null
2025-01-14	Go-with-the-Flow: Motion-Controllable Video Diffusion Models Using Real-Time Warped Noise	Ryan Burgert et.al.	2501.08331	link
2025-01-14	GameFactory: Creating New Games with Generative Interactive Videos	Jiwen Yu et.al.	2501.08325	null
2025-01-14	Diffusion Adversarial Post-Training for One-Step Video Generation	Shanchuan Lin et.al.	2501.08316	null
2025-01-14	LayerAnimate: Layer-specific Control for Animation	Yuxue Yang et.al.	2501.08295	null
2025-01-14	HALoGEN: Fantastic LLM Hallucinations and Where to Find Them	Abhilasha Ravichander et.al.	2501.08292	null
2025-01-14	FDPP: Fine-tune Diffusion Policy with Human Preference	Yuxin Chen et.al.	2501.08259	null
2025-01-14	Text-Diffusion Red-Teaming of Large Language Models: Unveiling Harmful Behaviors with Proximity Constraints	Jonathan Nöther et.al.	2501.08246	null
2025-01-14	Engineering LLM Powered Multi-agent Framework for Autonomous CloudOps	Kannan Parthasarathy et.al.	2501.08243	null
2025-01-14	CodecFake-Omni: A Large-Scale Codec-based Deepfake Speech Dataset	Jiawei Du et.al.	2501.08238	null
2025-01-14	FramePainter: Endowing Interactive Image Editing with Video Diffusion Priors	Yabo Zhang et.al.	2501.08225	link
2025-01-14	D $^2$ -DPM: Dual Denoising for Quantized Diffusion Probabilistic Models	Qian Zeng et.al.	2501.08180	link
2025-01-14	DM-Mamba: Dual-domain Multi-scale Mamba for MRI reconstruction	Yucong Meng et.al.	2501.08163	link
2025-01-14	Multiple-Input Variational Auto-Encoder for Anomaly Detection in Heterogeneous Data	Phai Vu Dinh et.al.	2501.08149	null
2025-01-13	Training-Free Motion-Guided Video Generation with Enhanced Temporal Consistency Using Motion Consistency Loss	Xinyu Zhang et.al.	2501.07563	null
2025-01-13	Confident Pseudo-labeled Diffusion Augmentation for Canine Cardiomegaly Detection	Shiman Zhang et.al.	2501.07533	link
2025-01-13	IP-FaceDiff: Identity-Preserving Facial Video Editing with Diffusion	Tharun Anand et.al.	2501.07530	null
2025-01-13	LitmusKt: Concurrency Stress Testing for Kotlin	Denis Lochmelis et.al.	2501.07472	link
2025-01-13	PrecipDiff: Leveraging image diffusion models to enhance satellite-based precipitation observations	Ting-Yu Dai et.al.	2501.07447	null
2025-01-13	Diff-Ensembler: Learning to Ensemble 2D Diffusion Models for Volume-to-Volume Medical Image Translation	Xiyue Zhu et.al.	2501.07430	null
2025-01-13	OCORD: Open-Campus Object Removal Dataset	Shuo Zhang et.al.	2501.07397	null
2025-01-13	Bigger Isn't Always Better: Towards a General Prior for Medical Image Reconstruction	Lukas Glaszner et.al.	2501.07376	link
2025-01-13	Simulating the Hubbard Model with Equivariant Normalizing Flows	Dominic Schuh et.al.	2501.07371	null
2025-01-13	Multimodal semantic retrieval for product search	Dong Liu et.al.	2501.07365	null
2025-01-13	Predicting System Dynamics of Universal Growth Patterns in Complex Systems	Leila Hedayatifar et.al.	2501.07349	null
2025-01-13	The Spectrum of C/2023 A3 Indicates A Depleted Composition	Yunyi Tang et.al.	2501.07340	null
2025-01-13	Foundation Models at Work: Fine-Tuning for Fairness in Algorithmic Hiring	Buse Sibel Korkmaz et.al.	2501.07324	link
2025-01-13	ViewVR: Visual Feedback Modes to Achieve Quality of VR-based Telemanipulation	A. Erkhov et.al.	2501.07299	link
2025-01-13	Skip Mamba Diffusion for Monocular 3D Semantic Scene Completion	Li Liang et.al.	2501.07260	link
2025-01-10	ScooterLab: A Programmable and Participatory Sensing Research Testbed using Micromobility Vehicles	Ubaidullah Khan et.al.	2501.06177	null
2025-01-10	VideoAuteur: Towards Long Narrative Video Generation	Junfei Xiao et.al.	2501.06173	null
2025-01-10	GenMol: A Drug Discovery Generalist with Discrete Diffusion	Seul Lee et.al.	2501.06158	null
2025-01-10	From discrete-time policies to continuous-time diffusion samplers: Asymptotic equivalences and faster training	Julius Berner et.al.	2501.06148	link
2025-01-10	The interplay of user preference and precision in different gaze-based interaction methods	Björn Rene Severitt et.al.	2501.06073	null
2025-01-10	Photokinetics of Photothermal Reactions	Mounir Maafi et.al.	2501.06057	null
2025-01-10	Nonisotropic Gaussian Diffusion for Realistic 3D Human Motion Prediction	Cecilia Curreli et.al.	2501.06035	null
2025-01-10	Resiliency metrics quantifying emergency response in a distribution system	Shikhar Pandey et.al.	2501.06030	null
2025-01-10	RPKI-Based Location-Unaware Tor Guard Relay Selection Algorithms	Zhifan Lu et.al.	2501.06010	link
2025-01-10	CamCtrl3D: Single-Image Scene Exploration with Precise 3D Camera Control	Stefan Popov et.al.	2501.06006	null
2025-01-10	Model Inversion in Split Learning for Personalized LLMs: New Insights from Information Bottleneck Theory	Yunmeng Shu et.al.	2501.05965	null
2025-01-10	Estimation and Restoration of Unknown Nonlinear Distortion using Diffusion	Michal Švento et.al.	2501.05959	link
2025-01-10	DiffuSETS: 12-lead ECG Generation Conditioned on Clinical Text Reports and Patient-Specific Information	Yongfan Lai et.al.	2501.05932	link
2025-01-10	Beyond Flat Text: Dual Self-inherited Guidance for Visual Text Generation	Minxing Luo et.al.	2501.05892	null
2025-01-10	Poetry in Pixels: Prompt Tuning for Poem Image Generation via Diffusion Models	Sofia Jamil et.al.	2501.05839	link
2025-01-09	Decentralized Diffusion Models	David McAllister et.al.	2501.05450	null
2025-01-09	Consistent Flow Distillation for Text-to-3D Generation	Runjie Yan et.al.	2501.05445	null
2025-01-09	Progressive Growing of Video Tokenizers for Highly Compressed Latent Spaces	Aniruddha Mahapatra et.al.	2501.05442	null
2025-01-09	The GAN is dead; long live the GAN! A Modern GAN Baseline	Yiwen Huang et.al.	2501.05441	link
2025-01-09	Zero-1-to-G: Taming Pretrained 2D Diffusion Model for Direct 3D Generation	Xuyi Meng et.al.	2501.05427	null
2025-01-09	Seeing Sound: Assembling Sounds from Visuals for Audio-to-Image Generation	Darius Petermann et.al.	2501.05413	null
2025-01-09	TimeDP: Learning to Generate Multi-Domain Time Series with Domain Prompts	Yu-Hao Huang et.al.	2501.05403	link
2025-01-09	Integrating Explainable AI for Effective Malware Detection in Encrypted Network Traffic	Sileshi Nibret Zeleke et.al.	2501.05387	null
2025-01-09	Accelerated Diffusion Models via Speculative Sampling	Valentin De Bortoli et.al.	2501.05370	null
2025-01-09	CROPS: Model-Agnostic Training-Free Framework for Safe Image Synthesis with Latent Diffusion Models	Junha Park et.al.	2501.05359	null
2025-01-09	Video-Conferencing Beyond Screen-Sharing and Thumbnail Webcam Videos: Gesture-Aware Augmented Reality Video for Data-Rich Remote Presentations	Matthew Brehmer et.al.	2501.05345	null
2025-01-09	The Bakers and Millers Game with Restricted Locations	Simon Krogmann et.al.	2501.05334	null
2025-01-09	Patch-GAN Transfer Learning with Reconstructive Models for Cloud Removal	Wanli Ma et.al.	2501.05265	null
2025-01-09	Light Transport-aware Diffusion Posterior Sampling for Single-View Reconstruction of 3D Volumes	Ludwic Leonard et.al.	2501.05226	link
2025-01-09	A Novel Approach to Scalable and Automatic Topic-Controlled Question Generation in Education	Ziqing Li et.al.	2501.05220	null
2025-01-08	EditAR: Unified Conditional Generation with Autoregressive Models	Jiteng Mu et.al.	2501.04699	null
2025-01-08	ConceptMaster: Multi-Concept Video Customization on Diffusion Transformer Models Without Test-Time Tuning	Yuzhou Huang et.al.	2501.04698	null
2025-01-08	SPAR3D: Stable Point-Aware Reconstruction of 3D Objects from Single Images	Zixuan Huang et.al.	2501.04689	null
2025-01-08	URSA: Understanding and Verifying Chain-of-thought Reasoning in Multimodal Mathematics	Ruilin Luo et.al.	2501.04686	link
2025-01-08	Integrating IPbus ALFRED into the ALICE-FIT setup	Krystian Roslon et.al.	2501.04685	null
2025-01-08	Enhancing Financial VQA in Vision Language Models using Intermediate Structured Representations	Archita Srivastava et.al.	2501.04675	null
2025-01-08	A Statistical Theory of Contrastive Pre-training and Multimodal Generative AI	Kazusato Oko et.al.	2501.04641	link
2025-01-08	Knowledge Retrieval Based on Generative AI	Te-Lun Yang et.al.	2501.04635	null
2025-01-08	Disentangled Clothed Avatar Generation with Layered Representation	Weitian Zhang et.al.	2501.04631	null
2025-01-09	MedCoDi-M: A Multi-Prompt Foundation Model for Multimodal Medical Data Generation	Daniele Molino et.al.	2501.04614	null
2025-01-08	Enhancing Low-Cost Video Editing with Lightweight Adaptors and Temporal-Aware Inversion	Yangfan He et.al.	2501.04606	link
2025-01-08	Understanding Expectations for a Robotic Guide Dog for Visually Impaired People	J. Taery Kim et.al.	2501.04594	null
2025-01-08	Improving Image Captioning by Mimicking Human Reformulation Feedback at Inference-time	Uri Berger et.al.	2501.04513	null
2025-01-08	Simultaneous MOKE imaging and measurement of magneto-resistance with vector magnet: a low noise customized setup for low field magnetic devices and thin films characterization	Imtiaz Noor Bhatti et.al.	2501.04431	null
2025-01-08	End-to-End Bangla AI for Solving Math Olympiad Problem Benchmark: Leveraging Large Language Model Using Integrated Approach	H. M. Shadman Tabib et.al.	2501.04425	null
2025-01-07	WAPTS: A Weighted Allocation Probability Adjusted Thompson Sampling Algorithm for High-Dimensional and Sparse Experiment Settings	Haochen Song et.al.	2501.03999	null
2025-01-07	Synthetic Data for Portfolios: A Throw of the Dice Will Never Abolish Chance	Adil Rengim Cetingoz et.al.	2501.03993	null
2025-01-07	NeuralSVG: An Implicit Representation for Text-to-Vector Generation	Sagi Polaczek et.al.	2501.03992	null
2025-01-07	Stabilising effect of generic anomalous diffusion independent of the Rayleigh number	Antonio Barletta et.al.	2501.03990	null
2025-01-07	Synthetic Data Privacy Metrics	Amy Steier et.al.	2501.03941	null
2025-01-07	Visual question answering: from early developments to recent advances -- a survey	Ngoc Dung Huynh et.al.	2501.03939	null
2025-01-07	A precise asymptotic analysis of learning diffusion models: theory and insights	Hugo Cui et.al.	2501.03937	link
2025-01-07	Magic Mirror: ID-Preserved Video Generation in Video Diffusion Transformers	Yuechen Zhang et.al.	2501.03931	link
2025-01-07	HYB-VITON: A Hybrid Approach to Virtual Try-On Combining Explicit and Implicit Warping	Kosuke Takemoto et.al.	2501.03910	link
2025-01-07	mFabric: An Efficient and Scalable Fabric for Mixture-of-Experts Training	Xudong Liao et.al.	2501.03905	null
2025-01-07	Rendezfood: A Design Case Study of a Conversational Location-based Approach in Restaurants	Philip Weber et.al.	2501.03862	null
2025-01-07	Diffusion as Shader: 3D-aware Video Diffusion for Versatile Video Generation Control	Zekai Gu et.al.	2501.03847	link
2025-01-07	Deep Sylvester Posterior Inference for Adaptive Compressed Sensing in Ultrasound Imaging	Simon W. Penninga et.al.	2501.03825	null
2025-01-07	Impact of diffusion mechanisms on persistence and spreading	Nathanaël Boutillon et.al.	2501.03816	null
2025-01-07	Private, Auditable, and Distributed Ledger for Financial Institutes	Shaltiel Eloul et.al.	2501.03808	link
2025-01-06	MObI: Multimodal Object Inpainting Using Diffusion Models	Alexandru Buburuzan et.al.	2501.03173	null
2025-01-06	Large language models for artificial general intelligence (AGI): A survey of foundational principles and approaches	Alhassan Mumuni et.al.	2501.03151	null
2025-01-06	DDRM-PR: Fourier Phase Retrieval using Denoising Diffusion Restoration Models	Mehmet Onurcan Kaya et.al.	2501.03030	link
2025-01-06	TransPixar: Advancing Text-to-Video Generation with Transparency	Luozhou Wang et.al.	2501.03006	link
2025-01-06	STAR: Spatial-Temporal Augmentation with Text-to-Video Models for Real-World Video Super-Resolution	Rui Xie et.al.	2501.02976	null
2025-01-06	Leader Rotation Is Not Enough: Scrutinizing Leadership Democracy of Chained BFT Consensus	Yining Tang et.al.	2501.02970	null
2025-01-07	SceneVTG++: Controllable Multilingual Visual Text Generation in the Wild	Jiawei Liu et.al.	2501.02962	null
2025-01-06	Inhibition of bacterial growth by antibiotics	Barnabe Ledoux et.al.	2501.02944	null
2025-01-06	Deep Generative Model-Aided Power System Dynamic State Estimation and Reconstruction with Unknown Control Inputs or Data Distributions	Jianhua Pei et.al.	2501.02928	null
2025-01-06	Pointmap-Conditioned Diffusion for Consistent Novel View Synthesis	Thang-Anh-Quan Nguyen et.al.	2501.02913	null
2025-01-06	Sim-to-Real Transfer for Mobile Robots with Reinforcement Learning: from NVIDIA Isaac Sim to Gazebo and Real ROS 2 Robots	Sahar Salimpour et.al.	2501.02902	link
2025-01-06	Conditional Mutual Information Based Diffusion Posterior Sampling for Solving Inverse Problems	Shayan Mohajer Hamidi et.al.	2501.02880	null
2025-01-06	Towards HRTF Personalization using Denoising Diffusion Models	Juan Camilo Albarracín Sánchez et.al.	2501.02871	null
2025-01-07	Diff-Lung: Diffusion-Based Texture Synthesis for Enhanced Pathological Tissue Segmentation in Lung CT Scans	Rezkellah Noureddine Khiati et.al.	2501.02867	null
2025-01-06	Large Language Models for Video Surveillance Applications	Ulindu De Silva et.al.	2501.02850	null
2025-01-03	Metadata Conditioning Accelerates Language Model Pre-training	Tianyu Gao et.al.	2501.01956	link
2025-01-03	MADGEN -- Mass-Spec attends to De Novo Molecular generation	Yinkai Wang et.al.	2501.01950	link
2025-01-03	Bridging Classification and Segmentation in Osteosarcoma Assessment via Foundation and Discrete Diffusion Models	Manh Duong Nguyen et.al.	2501.01932	link
2025-01-03	EnerVerse: Envisioning Embodied Future Space for Robotics Manipulation	Siyuan Huang et.al.	2501.01895	null
2025-01-03	Exploring Equality: An Investigation into Custom Loss Functions for Fairness Definitions	Gordon Lee et.al.	2501.01889	null
2025-01-03	LCFed: An Efficient Clustered Federated Learning Framework for Heterogeneous Data	Yuxin Zhang et.al.	2501.01850	null
2025-01-03	MoColl: Agent-Based Specific and General Model Collaboration for Image Captioning	Pu Yang et.al.	2501.01834	null
2025-01-03	Creating Artificial Students that Never Existed: Leveraging Large Language Models and CTGANs for Synthetic Data Generation	Mohammad Khalil et.al.	2501.01793	link
2025-01-03	Ingredients: Blending Custom Photos with Video Diffusion Transformers	Zhengcong Fei et.al.	2501.01790	link
2025-01-03	Nonparametric estimation of a factorizable density using diffusion models	Hyeok Kyu Kwon et.al.	2501.01783	null
2025-01-03	Customizing pseudospin unidirectional states of acoustic and electromagnetic waves in two-dimensional phoxonic topological insulators via multi-objective strategies	Gang-Gang Xu et.al.	2501.01766	null
2025-01-03	Constrained Pricing in Choice-based Revenue Management	Qian Shao et.al.	2501.01764	null
2025-01-03	Adverse Weather Conditions Augmentation of LiDAR Scenes with Latent Diffusion Models	Andrea Matteazzi et.al.	2501.01761	null
2025-01-03	MusicGen-Stem: Multi-stem music generation and edition through autoregressive modeling	Simon Rouard et.al.	2501.01757	null
2025-01-03	Combined Hyper-Extensible Extremely-Secured Zero-Trust CIAM-PAM architecture	Shivom Aggarwal et.al.	2501.01732	null
2025-01-02	Object-level Visual Prompts for Compositional Image Generation	Gaurav Parmar et.al.	2501.01424	null
2025-01-02	Reconstruction vs. Generation: Taming Optimization Dilemma in Latent Diffusion Models	Jingfeng Yao et.al.	2501.01423	link
2025-01-02	Multi-Modal Video Feature Extraction for Popularity Prediction	Haixu Liu et.al.	2501.01422	null
2025-01-02	Deep Discrete Encoders: Identifiable Deep Generative Models for Rich Data with Discrete Latent Layers	Seunghyun Lee et.al.	2501.01414	null
2025-01-02	On Unifying Video Generation and Camera Pose Estimation	Chun-Hao Paul Huang et.al.	2501.01409	null
2025-01-02	Test-time Controllable Image Generation by Explicit Spatial Constraint Enforcement	Z. Zhang et.al.	2501.01368	null
2025-01-02	Contrastive Learning from Exploratory Actions: Leveraging Natural Interactions for Preference Elicitation	Nathaniel Dennler et.al.	2501.01367	null
2025-01-03	Conditional Consistency Guided Image Translation and Enhancement	Amil Bhagat et.al.	2501.01223	link
2025-01-03	TabTreeFormer: Tabular Data Generation Using Hybrid Tree-Transformer	Jiayu Li et.al.	2501.01216	null
2025-01-02	Range-Only Localization System for Small-Scale Flapping-Wing Robots	Raul Tapia et.al.	2501.01213	link
2025-01-02	LayeringDiff: Layered Image Synthesis via Generation, then Disassembly with Generative Knowledge	Kyoungkook Kang et.al.	2501.01197	null
2025-01-02	TexAVi: Generating Stereoscopic VR Video Clips from Text Descriptions	Vriksha Srihari et.al.	2501.01156	null
2025-01-02	Semantics-Guided Diffusion for Deep Joint Source-Channel Coding in Wireless Image Transmission	Maojun Zhang et.al.	2501.01138	link
2025-01-02	Co-Design of a Robot Controller Board and Indoor Positioning System for IoT-Enabled Applications	Ali Safa et.al.	2501.01115	null
2025-01-02	MalCL: Leveraging GAN-Based Generative Replay to Combat Catastrophic Forgetting in Malware Classification	Jimin Park et.al.	2501.01110	link
2024-12-30	The Gaussian Kicked Rotor: Periodic forcing with finite-width pulses and the role of shifting the kick	Jonathan Berkheim et.al.	2412.21186	null
2024-12-30	Unified dimensionality reduction techniques in chronic liver disease detection	Anand Karna et.al.	2412.21156	null
2025-01-02	Prometheus: 3D-Aware Latent Diffusion Models for Feed-Forward Text-to-3D Scene Generation	Yuanbo Yang et.al.	2412.21117	null
2024-12-30	Impact of Fourth Industrial Revolution (4IR) on Small and Medium Enterprises (SMEs) and Employment in Bangladesh: Opportunities and Challenges	Toukir Ahammed et.al.	2412.21106	null
2024-12-30	Quantum Diffusion Model for Quark and Gluon Jet Generation	Mariia Baidachna et.al.	2412.21082	link
2025-01-02	Edicho: Consistent Image Editing in the Wild	Qingyan Bai et.al.	2412.21079	link
2024-12-30	Varformer: Adapting VAR's Generative Prior for Image Restoration	Siyang Wang et.al.	2412.21063	link
2024-12-30	VisionReward: Fine-Grained Multi-Dimensional Human Preference Learning for Image and Video Generation	Jiazheng Xu et.al.	2412.21059	link
2024-12-30	E2EDiff: Direct Mapping from Noise to Data for Enhanced Diffusion Models	Zhiyu Tan et.al.	2412.21044	null
2024-12-30	Visual Style Prompt Learning Using Diffusion Models for Blind Face Restoration	Wanglong Lu et.al.	2412.21042	link
2024-12-30	TangoFlux: Super Fast and Faithful Text to Audio Generation with Flow Matching and Clap-Ranked Preference Optimization	Chia-Yu Hung et.al.	2412.21037	link
2024-12-30	Verified Lifting of Deep learning Operators	Qi Zhan et.al.	2412.20992	null
2024-12-30	AlignAb: Pareto-Optimal Energy Alignment for Designing Nature-Like Antibodies	Yibo Wen et.al.	2412.20984	null
2024-12-30	AGON: Automated Design Framework for Customizing Processors from ISA Documents	Chongxiao Li et.al.	2412.20954	null
2024-12-30	AI-Supported Data Analysis Boosts Student Motivation and Reduces Stress in Physics Education	Jannik Henze et.al.	2412.20951	null
2024-12-27	Tensor Network Estimation of Distribution Algorithms	John Gardiner et.al.	2412.19780	null
2024-12-27	Generative Video Propagation	Shaoteng Liu et.al.	2412.19761	null
2024-12-27	Complement or substitute? How AI increases the demand for human skills	Elina Mäkelä et.al.	2412.19754	null
2024-12-27	Text2Insight: Transform natural language text into insights seamlessly using multi-model architecture	Pradeep Sain et.al.	2412.19718	null
2024-12-27	From Elements to Design: A Layered Approach for Automatic Graphic Design Composition	Jiawei Lin et.al.	2412.19712	null
2024-12-27	An Integrated Optimization and Deep Learning Pipeline for Predicting Live Birth Success in IVF Using Feature Optimization and Transformer-Based Models	Arezoo Borji et.al.	2412.19696	null
2024-12-27	From prediction to explanation: managing influential negative reviews through explainable AI	Rongping Shen et.al.	2412.19692	null
2024-12-27	VideoMaker: Zero-shot Customized Video Generation with the Inherent Force of Video Diffusion Models	Tao Wu et.al.	2412.19645	null
2024-12-27	Diverse Rare Sample Generation with Pretrained GANs	Subeen Lee et.al.	2412.19543	link
2024-12-27	Scalable Hierarchical Reinforcement Learning for Hyper Scale Multi-Robot Task Planning	Xuan Zhou et.al.	2412.19538	null
2024-12-27	StyleRWKV: High-Quality and High-Efficiency Style Transfer with RWKV-like Architecture	Miaomiao Dai et.al.	2412.19535	null
2024-12-27	Lévy Score Function and Score-Based Particle Algorithm for Nonlinear Lévy--Fokker--Planck Equations	Yuanfei Huang et.al.	2412.19520	link
2024-12-27	Estimation of System Parameters Including Repeated Cross-Sectional Data through Emulator-Informed Deep Generative Model	Hyunwoo Cho et.al.	2412.19517	null
2024-12-27	DrivingWorld: ConstructingWorld Model for Autonomous Driving via Video GPT	Xiaotao Hu et.al.	2412.19505	link
2024-12-27	RobotDiffuse: Motion Planning for Redundant Manipulator based on Diffusion Model	Xiaohan Zhang et.al.	2412.19500	link
2024-12-24	PartGen: Part-level 3D Generation and Reconstruction with Multi-View Diffusion Models	Minghao Chen et.al.	2412.18608	null
2024-12-24	DrivingGPT: Unifying Driving World Modeling and Planning with Multi-modal Autoregressive Transformers	Yuntao Chen et.al.	2412.18607	null
2024-12-24	Explaining in Diffusion: Explaining a Classifier Through Hierarchical Semantics with Text-to-Image Diffusion Models	Tahira Kazimi et.al.	2412.18604	null
2024-12-24	Long-Form Speech Generation with Spoken Language Models	Se Jin Park et.al.	2412.18603	link
2024-12-24	ZeroHSI: Zero-Shot 4D Human-Scene Interaction by Video Generation	Hongjie Li et.al.	2412.18600	null
2024-12-24	DiTCtrl: Exploring Attention Control in Multi-Modal Diffusion Transformer for Tuning-Free Multi-Prompt Longer Video Generation	Minghong Cai et.al.	2412.18597	link
2024-12-24	LatentCRF: Continuous CRF for Efficient Latent Diffusion	Kanchana Ranasinghe et.al.	2412.18596	null
2024-12-24	Resolution-Robust 3D MRI Reconstruction with 2D Diffusion Priors: Diverse-Resolution Training Outperforms Interpolation	Anselm Krainovic et.al.	2412.18584	null
2024-12-24	3DEnhancer: Consistent Multi-View Diffusion for 3D Enhancement	Yihang Luo et.al.	2412.18565	null
2024-12-24	Elevating Information System Performance: A Deep Dive into Quality Metrics	Dana A Abdullah et.al.	2412.18512	null
2024-12-24	A region-wide, multi-year set of crop field boundary labels for Africa	L. D. Estes et.al.	2412.18483	null
2024-12-24	GeFL: Model-Agnostic Federated Learning with Generative Models	Honggu Kang et.al.	2412.18460	null
2024-12-24	Gaussian entropic optimal transport: Schrödinger bridges and the Sinkhorn algorithm	O. Deniz Akyildiz et.al.	2412.18432	null
2024-12-24	Fashionability-Enhancing Outfit Image Editing with Conditional Diffusion Models	Qice Qin et.al.	2412.18421	null
2024-12-24	Discovery of 2D Materials via Symmetry-Constrained Diffusion Model	Shihang Xu et.al.	2412.18414	null
2024-12-23	FaceLift: Single Image to 3D Head with View Generation and GS-LRM	Weijie Lyu et.al.	2412.17812	null
2024-12-23	PepTune: De Novo Generation of Therapeutic Peptides with Multi-Objective-Guided Discrete Diffusion	Sophia Tang et.al.	2412.17780	null
2024-12-23	The Superposition of Diffusion Models Using the Itô Density Estimator	Marta Skreta et.al.	2412.17762	null
2024-12-23	Superconductivity in Nanosystems: A Fruitful Path to New Phenomenology in Quantum Materials	M. V. Ramallo et.al.	2412.17722	null
2024-12-23	A Bias-Free Training Paradigm for More General AI-generated Image Detection	Fabrizio Guillaro et.al.	2412.17671	null
2024-12-23	Benchmarking Generative AI Models for Deep Learning Test Input Generation	Maryam et.al.	2412.17652	link
2024-12-23	DreamFit: Garment-Centric Human Generation via a Lightweight Anything-Dressing Encoder	Ente Lin et.al.	2412.17644	null
2024-12-23	ANID: How Far Are We? Evaluating the Discrepancies Between AI-synthesized Images and Natural Images through Multimodal Guidance	Renyang Liu et.al.	2412.17632	link
2024-12-23	Be More Diverse than the Most Diverse: Online Selection of Diverse Mixtures of Generative Models	Parham Rezaei et.al.	2412.17622	link
2024-12-23	Empathetic Response in Audio-Visual Conversations Using Emotion Preference Optimization and MambaCompressor	Yeonju Kim et.al.	2412.17572	null
2024-12-23	The Dynamic Duo of Collaborative Masking and Target for Advanced Masked Autoencoder Learning	Shentong Mo et.al.	2412.17566	null
2024-12-23	S-INF: Towards Realistic Indoor Scene Synthesis via Scene Implicit Neural Field	Zixi Liang et.al.	2412.17561	link
2024-12-23	Resource-Aware Arabic LLM Creation: Model Adaptation, Integration, and Multi-Domain Testing	Prakash Aryan et.al.	2412.17548	link
2024-12-23	Retention Score: Quantifying Jailbreak Risks for Vision Language Models	Zaitang Li et.al.	2412.17544	null
2024-12-23	CiteBART: Learning to Generate Citations for Local Citation Recommendation	Ege Yiğit Çelik et.al.	2412.17534	link
2024-12-20	Personalized Representation from Personalized Generation	Shobhita Sundaram et.al.	2412.16156	link
2024-12-20	Can Generative Video Models Help Pose Estimation?	Ruojin Cai et.al.	2412.16155	null
2024-12-20	FedGAT: A Privacy-Preserving Federated Approximation Algorithm for Graph Attention Networks	Siddharth Ambekar et.al.	2412.16144	null
2024-12-20	NeRF-To-Real Tester: Neural Radiance Fields as Test Image Generators for Vision of Autonomous Systems	Laura Weihl et.al.	2412.16141	null
2024-12-20	Predicting human cooperation: sensitizing drift-diffusion model to interaction and external stimuli	Lucila G. Alvarez-Zuzek et.al.	2412.16121	null
2024-12-20	Differentially Private Federated Learning of Diffusion Models for Synthetic Tabular Data Generation	Timur Sattarov et.al.	2412.16083	null
2024-12-20	Label-Efficient Data Augmentation with Video Diffusion Models for Guidewire Segmentation in Cardiac Fluoroscopy	Shaoyan Pan et.al.	2412.16050	null
2024-12-20	SafeCFG: Redirecting Harmful Classifier-Free Guidance for Safe Generation	Jiadong Pan et.al.	2412.16039	null
2024-12-20	Electric Vehicle Charging Stations Placement Optimization in Vietnam Using Mixed-Integer Nonlinear Programming Model	Quynh Vu Truc et.al.	2412.16025	link
2024-12-20	Data-Centric Improvements for Enhancing Multi-Modal Understanding in Spoken Conversation Modeling	Maximillian Chen et.al.	2412.15995	null
2024-12-20	Optimization of Beyond Diagonal RIS: A Universal Framework Applicable to Arbitrary Architectures	Zheyu Wu et.al.	2412.15965	null
2024-12-20	Reframing Image Difference Captioning with BLIP2IDC and Synthetic Augmentation	Gautier Evennou et.al.	2412.15939	link
2024-12-20	RiTTA: Modeling Event Relations in Text-to-Audio Generation	Yuhang He et.al.	2412.15922	link
2024-12-20	Less is More: Towards Green Code Large Language Models via Unified Structural Pruning	Guang Yang et.al.	2412.15921	null
2024-12-20	Semi-Supervised Adaptation of Diffusion Models for Handwritten Text Generation	Kai Brandenbusch et.al.	2412.15853	null
2024-12-19	LeviTor: 3D Trajectory Oriented Image-to-Video Synthesis	Hanlin Wang et.al.	2412.15214	link
2024-12-19	Flowing from Words to Pixels: A Framework for Cross-Modality Evolution	Qihao Liu et.al.	2412.15213	null
2024-12-19	Generative Multiview Relighting for 3D Reconstruction under Extreme Illumination Variation	Hadi Alzayer et.al.	2412.15211	null
2024-12-19	AV-Link: Temporally-Aligned Diffusion Features for Cross-Modal Audio-Video Generation	Moayed Haji-Ali et.al.	2412.15191	null
2024-12-19	LlamaFusion: Adapting Pretrained Language Models for Multimodal Generation	Weijia Shi et.al.	2412.15188	null
2024-12-19	Tiled Diffusion	Or Madar et.al.	2412.15185	null
2024-12-19	SqueezeMe: Efficient Gaussian Avatars for VR	Shunsuke Saito et.al.	2412.15171	null
2024-12-19	OnlineVPO: Align Video Diffusion Model with Online Video-Centric Preference Optimization	Jiacheng Zhang et.al.	2412.15159	null
2024-12-19	Prompt-A-Video: Prompt Your Video Diffusion Model via Preference-Aligned LLM	Yatai Ji et.al.	2412.15156	link
2024-12-19	Jet: A Modern Transformer-Based Normalizing Flow	Alexander Kolesnikov et.al.	2412.15129	null
2024-12-19	Predictive Inverse Dynamics Models are Scalable Learners for Robotic Manipulation	Yang Tian et.al.	2412.15109	link
2024-12-19	Learning Disentangled Equivariant Representation for Explicitly Controllable 3D Molecule Generation	Haoran Liu et.al.	2412.15086	null
2024-12-19	Eigenstate Preparation on Quantum Computers	Joey Bonitati et.al.	2412.15081	null
2024-12-19	Uni-Renderer: Unifying Rendering and Inverse Rendering Via Dual Stream Diffusion	Zhifei Chen et.al.	2412.15050	null
2024-12-19	DCTdiff: Intriguing Properties of Image Generative Modeling in the DCT Space	Mang Ning et.al.	2412.15032	link
2024-12-18	AniDoc: Animation Creation Made Easier	Yihao Meng et.al.	2412.14173	null
2024-12-19	E-CAR: Efficient Continuous Autoregressive Image Generation via Multistage Modeling	Zhihang Yuan et.al.	2412.14170	null
2024-12-18	Autoregressive Video Generation without Vector Quantization	Haoge Deng et.al.	2412.14169	link
2024-12-18	VideoDPO: Omni-Preference Alignment for Video Diffusion Generation	Runtao Liu et.al.	2412.14167	null
2024-12-18	MetaMorph: Multimodal Understanding and Generation via Instruction Tuning	Shengbang Tong et.al.	2412.14164	null
2024-12-18	MCMat: Multiview-Consistent and Physically Accurate PBR Material Generation	Shenhao Zhu et.al.	2412.14148	null
2024-12-18	Event-based Photometric Bundle Adjustment	Shuang Guo et.al.	2412.14111	link
2024-12-18	Future Research Avenues for Artificial Intelligence in Digital Gaming: An Exploratory Report	Markus Dablander et.al.	2412.14085	null
2024-12-18	SurgSora: Decoupled RGBD-Flow Diffusion Model for Controllable Surgical Video Generation	Tong Chen et.al.	2412.14018	null
2024-12-18	Comparative Analysis of Machine Learning-Based Imputation Techniques for Air Quality Datasets with High Missing Data Rates	Sen Yan et.al.	2412.13966	null
2024-12-18	A Rose by Any Other Name: LLM-Generated Explanations Are Good Proxies for Human Explanations to Collect Label Distributions on NLI	Beiduo Chen et.al.	2412.13942	link
2024-12-18	Development of a High-Resolution, High-Dynamic-Range Charge Detector for Ion Beam Monitoring	O. Adriani et.al.	2412.13934	null
2024-12-18	Investigating the Effects of Diffusion-based Conditional Generative Speech Models Used for Speech Enhancement on Dysarthric Speech	Joanna Reszka et.al.	2412.13933	null
2024-12-18	Graph-Driven Models for Gas Mixture Identification and Concentration Estimation on Heterogeneous Sensor Array Signals	Ding Wang et.al.	2412.13891	null
2024-12-18	Navigating limitations with precision: A fine-grained ensemble approach to wrist pathology recognition on a limited x-ray dataset	Ammar Ahmed et.al.	2412.13884	null
2024-12-17	CoMPaSS: Enhancing Spatial Understanding in Text-to-Image Diffusion Models	Gaoyang Zhang et.al.	2412.13195	link
2024-12-17	StreetCrafter: Street View Synthesis with Controllable Video Diffusion Models	Yunzhi Yan et.al.	2412.13188	null
2024-12-17	Move-in-2D: 2D-Conditioned Human Motion Generation	Hsin-Ping Huang et.al.	2412.13185	null
2024-12-17	F-Bench: Rethinking Human Preference Evaluation Metrics for Benchmarking Face Generation, Customization, and Restoration	Lu Liu et.al.	2412.13155	null
2024-12-17	Prompt Augmentation for Self-supervised Text-guided Image Manipulation	Rumeysa Bodur et.al.	2412.13081	null
2024-12-17	3D MedDiffusion: A 3D Medical Diffusion Model for Controllable and High-quality Medical Image Generation	Haoshen Wang et.al.	2412.13059	null
2024-12-17	Guiding Generative Protein Language Models with Reinforcement Learning	Filippo Stocco et.al.	2412.12979	link
2024-12-18	Attentive Eraser: Unleashing Diffusion Model's Object Removal Potential via Self-Attention Redirection Guidance	Wenhao Sun et.al.	2412.12974	link
2024-12-17	ArchesWeather & ArchesWeatherGen: a deterministic and generative model for efficient ML weather forecasting	Guillaume Couairon et.al.	2412.12971	link
2024-12-17	Modified UNIFAC 2.0 -- A Group-Contribution Method Completed with Machine Learning	Nicolas Hayer et.al.	2412.12962	null
2024-12-17	MOPO: Multi-Objective Prompt Optimization for Affective Text Generation	Yarik Menchaca Resendiz et.al.	2412.12948	null
2024-12-17	Generation of cosmic ray trajectories by a Diffusion Model trained on test particles in 3D magnetohydrodynamic turbulence	Johannes Martin et.al.	2412.12923	null
2024-12-17	Unsupervised Region-Based Image Editing of Denoising Diffusion Models	Zixiang Li et.al.	2412.12912	null
2024-12-18	ArtAug: Enhancing Text-to-Image Generation through Synthesis-Understanding Interaction	Zhongjie Duan et.al.	2412.12888	link
2024-12-17	Memory-minimal quantum generation of stochastic processes: spectral invariants of quantum hidden Markov models	Magdalini Zonnios et.al.	2412.12812	null
2024-12-16	Causal Diffusion Transformers for Generative Modeling	Chaorui Deng et.al.	2412.12095	link
2024-12-16	CAP4D: Creating Animatable 4D Portrait Avatars with Morphable Multi-View Diffusion Models	Felix Taubner et.al.	2412.12093	null
2024-12-16	Wonderland: Navigating 3D Scenes from a Single Image	Hanwen Liang et.al.	2412.12091	null
2024-12-16	A LoRA is Worth a Thousand Pictures	Chenxi Liu et.al.	2412.12048	null
2024-12-16	LLMs for Cold-Start Cutting Plane Separator Configuration	Connor Lawless et.al.	2412.12038	link
2024-12-16	Learning to Navigate in Mazes with Novel Layouts using Abstract Top-down Maps	Linfeng Zhao et.al.	2412.12024	null
2024-12-16	The entropic optimal (self-)transport problem: Limit distributions for decreasing regularization with application to score function estimation	Gilles Mordant et.al.	2412.12007	null
2024-12-16	Controllable Shadow Generation with Single-Step Diffusion Models from Synthetic Data	Onur Tasar et.al.	2412.11972	null
2024-12-16	The Erdős unit distance problem for small point sets	Boris Alexeev et.al.	2412.11914	null
2024-12-16	CharacterBench: Benchmarking Character Customization of Large Language Models	Jinfeng Zhou et.al.	2412.11912	link
2024-12-16	Towards Understanding Systems Trade-offs in Retrieval-Augmented Generation Model Inference	Michael Shen et.al.	2412.11854	null
2024-12-16	ColorFlow: Retrieval-Augmented Image Sequence Colorization	Junhao Zhuang et.al.	2412.11815	null
2024-12-16	InterDyn: Controllable Interactive Dynamics with Video Diffusion Models	Rick Akkerman et.al.	2412.11785	null
2024-12-16	Joint Reconstruction of the Activity and the Attenuation in PET by Diffusion Posterior Sampling: a Feasibility Study	Clémentine Phung-Ngoc et.al.	2412.11776	null
2024-12-17	No More Adam: Learning Rate Scaling at Initialization is All You Need	Minghao Xu et.al.	2412.11768	link
2024-12-13	Towards a foundation model for heavy-ion collision experiments through point cloud diffusion	Manjunath Omana Kuttan et.al.	2412.10352	null
2024-12-13	BrushEdit: All-In-One Image Inpainting and Editing	Yaowei Li et.al.	2412.10316	null
2024-12-13	Iterating the Transient Light Transport Matrix for Non-Line-of-Sight Imaging	Talha Sultan et.al.	2412.10300	null
2024-12-13	Coherent 3D Scene Diffusion From a Single RGB Image	Manuel Dahnert et.al.	2412.10294	null
2024-12-13	Adversarial Robustness of Bottleneck Injected Deep Neural Networks for Task-Oriented Communication	Alireza Furutanpey et.al.	2412.10265	null
2024-12-13	Targeted Angular Reversal of Weights (TARS) for Knowledge Removal in Large Language Models	Harry J. Davies et.al.	2412.10257	null
2024-12-13	Exploring the Frontiers of Animation Video Generation in the Sora Era: Method, Dataset and Benchmark	Yudong Jiang et.al.	2412.10255	link
2024-12-13	Radiator Tailoring for Enhanced Performance in InAs-Based Near-Field Thermophotovoltaics	Mathieu Giroux et.al.	2412.10217	null
2024-12-13	GAF: Gaussian Avatar Reconstruction from Monocular Videos via Multi-view Diffusion	Jiapeng Tang et.al.	2412.10209	null
2024-12-13	Efficient Generative Modeling with Residual Vector Quantization-Based Tokens	Jaehyeon Kim et.al.	2412.10208	null
2024-12-13	Simple Guidance Mechanisms for Discrete Diffusion Models	Yair Schiff et.al.	2412.10193	link
2024-12-13	SwiftTry: Fast and Consistent Video Virtual Try-On with Diffusion Models	Hung Nguyen et.al.	2412.10178	null
2024-12-13	Learning payoffs while routing in skill-based queues	Sanne van Kempen et.al.	2412.10168	null
2024-12-13	The Art of Deception: Color Visual Illusions and Diffusion Models	Alex Gomez-Villa et.al.	2412.10122	null
2024-12-13	Familiarity: Better Evaluation of Zero-Shot Named Entity Recognition by Quantifying Label Shifts in Synthetic Training Data	Jonas Golde et.al.	2412.10121	link
2024-12-12	FreeScale: Unleashing the Resolution of Diffusion Models via Tuning-Free Scale Fusion	Haonan Qiu et.al.	2412.09626	null
2024-12-12	Illusion3D: 3D Multiview Illusion with 2D Diffusion Priors	Yue Feng et.al.	2412.09625	null
2024-12-12	GenEx: Generating an Explorable World	Taiming Lu et.al.	2412.09624	null
2024-12-12	OmniDrag: Enabling Motion Control for Omnidirectional Image-to-Video Generation	Weiqi Li et.al.	2412.09623	null
2024-12-12	LoRACLR: Contrastive Adaptation for Customization of Diffusion Models	Enis Simsar et.al.	2412.09622	null
2024-12-12	SnapGen: Taming High-Resolution Text-to-Image Models for Mobile Devices with Efficient Architectures and Training	Dongting Hu et.al.	2412.09619	null
2024-12-12	EasyRef: Omni-Generalized Group Image Reference for Diffusion Models via Multimodal LLM	Zhuofan Zong et.al.	2412.09618	null
2024-12-12	Context Canvas: Enhancing Text-to-Image Diffusion Models with Knowledge Graph-Based RAG	Kavana Venkatesh et.al.	2412.09614	null
2024-12-13	Olympus: A Universal Task Router for Computer Vision Tasks	Yuanze Lin et.al.	2412.09612	link
2024-12-12	Owl-1: Omni World Model for Consistent Long Video Generation	Yuanhui Huang et.al.	2412.09600	link
2024-12-12	LiftImage3D: Lifting Any Single Image to 3D Gaussians with Video Generation Priors	Yabo Chen et.al.	2412.09597	null
2024-12-12	Neural LightRig: Unlocking Accurate Object Normal and Material Estimation with Multi-Light Diffusion	Zexin He et.al.	2412.09593	null
2024-12-12	Improving the Reliability of Cable Broadband Networks via Proactive Network Maintenance	Jiyao Hu et.al.	2412.09564	null
2024-12-12	Meshtron: High-Fidelity, Artist-Like 3D Mesh Generation at Scale	Zekun Hao et.al.	2412.09548	null
2024-12-12	SimAvatar: Simulation-Ready Avatars with Layered Hair and Clothing	Xueting Li et.al.	2412.09545	null
2024-12-11	Generative Semantic Communication: Architectures, Technologies, and Applications	Jinke Ren et.al.	2412.08642	null
2024-12-11	DMin: Scalable Training Data Influence Estimation for Diffusion Models	Huawei Lin et.al.	2412.08637	link
2024-12-11	Multimodal Latent Language Modeling with Next-Token Diffusion	Yutao Sun et.al.	2412.08635	link
2024-12-11	An SDR-Based Monostatic Wi-Fi System with Analog Self-Interference Cancellation for Sensing	Andreas Toftegaard Kristensen et.al.	2412.08612	null
2024-12-12	Design2GarmentCode: Turning Design Concepts to Tangible Garments Through Program Synthesis	Feng Zhou et.al.	2412.08603	null
2024-12-11	TryOffAnyone: Tiled Cloth Generation from a Dressed Person	Ioannis Xarchakos et.al.	2412.08573	link
2024-12-12	Watermarking Training Data of Music Generation Models	Pascal Epple et.al.	2412.08549	null
2024-12-11	Orderly Management of Packets in RDMA by Eunomia	Sana Mahmood et.al.	2412.08540	null
2024-12-11	Ensemble-Based Quantum-Token Protocol Benchmarked on IBM Quantum Processors	Lucas Tsunaki et.al.	2412.08530	link
2024-12-11	Comparative Opinion Mining in Product Reviews: Multi-perspective Prompt-based Learning	Hai-Yen Thi Nguyen et.al.	2412.08508	null
2024-12-11	Open-Loop and Model Predictive Control for Electric Vehicle Charging to Manage Excess Renewable Energy Supply in Texas	Kelsey M. Nelson et.al.	2412.08505	null
2024-12-11	Learning Flow Fields in Attention for Controllable Person Image Generation	Zijian Zhou et.al.	2412.08486	link
2024-12-11	InvDiff: Invariant Guidance for Bias Mitigation in Diffusion Models	Min Hou et.al.	2412.08480	link
2024-12-11	CC-Diff: Enhancing Contextual Coherence in Remote Sensing Image Synthesis	Mu Zhang et.al.	2412.08464	null
2024-12-11	Federated Learning for Traffic Flow Prediction with Synthetic Data Augmentation	Fermin Orozco et.al.	2412.08460	null
2024-12-10	Efficient Diversity-Preserving Diffusion Alignment via Gradient-Informed GFlowNets	Zhen Liu et.al.	2412.07775	null
2024-12-10	UniReal: Universal Image Generation and Editing via Learning Real-world Dynamics	Xi Chen et.al.	2412.07774	null
2024-12-10	From Slow Bidirectional to Fast Causal Video Generators	Tianwei Yin et.al.	2412.07772	null
2024-12-10	Make-A-Texture: Fast Shape-Aware Texture Generation in 3 Seconds	Xiaoyu Xiang et.al.	2412.07766	null
2024-12-10	Bayesian Optimization of Antibodies Informed by a Generative Model of Evolving Sequences	Alan Nawzad Amin et.al.	2412.07763	link
2024-12-10	Repurposing Pre-trained Video Diffusion Models for Event-based Video Interpolation	Jingxi Chen et.al.	2412.07761	null
2024-12-10	SynCamMaster: Synchronizing Multi-Camera Video Generation from Diverse Viewpoints	Jianhong Bai et.al.	2412.07760	link
2024-12-10	PortraitTalk: Towards Customizable One-Shot Audio-to-Talking Face Generation	Fatemeh Nazarieh et.al.	2412.07754	null
2024-12-10	Multi-Shot Character Consistency for Text-to-Video Generation	Yuval Atzmon et.al.	2412.07750	null
2024-12-10	StyleMaster: Stylize Your Video with Artistic Generation and Translation	Zixuan Ye et.al.	2412.07744	null
2024-12-10	STIV: Scalable Text and Image Conditioned Video Generation	Zongyu Lin et.al.	2412.07730	null
2024-12-10	ObjCtrl-2.5D: Training-free Object Control with Camera Poses	Zhouxia Wang et.al.	2412.07721	null
2024-12-10	ACDiT: Interpolating Autoregressive Conditional Modeling and Diffusion Transformer	Jinyi Hu et.al.	2412.07720	link
2024-12-10	Privacy-Preserving Customer Support: A Framework for Secure and Scalable Interactions	Anant Prakash Awasthi et.al.	2412.07687	null
2024-12-10	Optimizing Sensor Redundancy in Sequential Decision-Making Problems	Jonas Nüßlein et.al.	2412.07686	null
2024-12-10	[MASK] is All You Need	Vincent Tao Hu et.al.	2412.06787	link
2024-12-09	Tactile DreamFusion: Exploiting Tactile Sensing for 3D Generation	Ruihan Gao et.al.	2412.06785	link
2024-12-09	Diverse Score Distillation	Yanbo Xu et.al.	2412.06780	null
2024-12-09	Visual Lexicon: Rich Image Features in Language Space	XuDong Wang et.al.	2412.06774	null
2024-12-09	InstantRestore: Single-Step Personalized Face Restoration with Shared-Image Attention	Howard Zhang et.al.	2412.06753	null
2024-12-09	ONEBench to Test Them All: Sample-Level Benchmarking Over Open-Ended Capabilities	Adhiraj Ghosh et.al.	2412.06745	null
2024-12-10	ContRail: A Framework for Realistic Railway Image Synthesis using ControlNet	Andrei-Robert Alexandrescu et.al.	2412.06742	null
2024-12-09	Take Fake as Real: Realistic-like Robust Black-box Adversarial Attack to Evade AIGC Detection	Caiyun Xie et.al.	2412.06727	link
2024-12-09	You See it, You Got it: Learning 3D Creation on Pose-Free Videos at Scale	Baorui Ma et.al.	2412.06699	link
2024-12-09	Gen-3Diffusion: Realistic Image-to-3D Generation via 2D & 3D Diffusion Synergy	Yuxuan Xue et.al.	2412.06698	null
2024-12-09	Diff5T: Benchmarking Human Brain Diffusion MRI with an Extensive 5.0 Tesla K-Space and Spatial Dataset	Shanshan Wang et.al.	2412.06666	null
2024-12-09	Efficiency Meets Fidelity: A Novel Quantization Framework for Stable Diffusion	Shuaiting Li et.al.	2412.06661	null
2024-12-09	MVReward: Better Aligning and Evaluating Multi-View Diffusion Models with Human Preferences	Weitao Wang et.al.	2412.06614	null
2024-12-09	Augmented reality for upper limb rehabilitation: real-time kinematic feedback with HoloLens 2	Beatrice Luciani et.al.	2412.06596	null
2024-12-09	EmoSpeech: A Corpus of Emotionally Rich and Contextually Detailed Speech Annotations	Weizhen Bian et.al.	2412.06581	null
2024-12-06	Stag-1: Towards Realistic 4D Driving Simulation with Video Generation Model	Lening Wang et.al.	2412.05280	link
2024-12-06	Perturb-and-Revise: Flexible 3D Editing with Generative Trajectories	Susung Hong et.al.	2412.05279	null
2024-12-06	Birth and Death of a Rose	Chen Geng et.al.	2412.05278	null
2024-12-06	MotionFlow: Attention-Driven Motion Transfer in Video Diffusion Models	Tuna Han Salih Meral et.al.	2412.05275	null
2024-12-06	Go-or-Grow Models in Biology: a Monster on a Leash	R. Thiessen et.al.	2412.05191	null
2024-12-06	Privacy Drift: Evolving Privacy Concerns in Incremental Learning	Sayyed Farid Ahamed et.al.	2412.05183	null
2024-12-06	DNF: Unconditional 4D Generation with Dictionary-based Neural Fields	Xinyi Zhang et.al.	2412.05161	null
2024-12-06	A text-to-tabular approach to generate synthetic patient data using LLMs	Margaux Tornqvist et.al.	2412.05153	link
2024-12-06	LoRA.rar: Learning to Merge LoRAs via Hypernetworks for Subject-Style Conditioned Image Generation	Donald Shenaj et.al.	2412.05148	link
2024-12-06	How to Squeeze An Explanation Out of Your Model	Tiago Roxo et.al.	2412.05134	null
2024-12-06	Probabilistic Galaxy Field Generation with Diffusion Models	Tanner Sether et.al.	2412.05131	null
2024-12-06	The Silent Prompt: Initial Noise as Implicit Guidance for Goal-Driven Image Generation	Ruoyu Wang et.al.	2412.05101	null
2024-12-06	Reconstructing Quantitative Cerebral Perfusion Images Directly From Measured Sinogram Data Acquired Using C-arm Cone-Beam CT	Haotian Zhao et.al.	2412.05084	null
2024-12-06	ReF-LDM: A Latent Diffusion Model for Reference-based Face Image Restoration	Chi-Wei Hsiao et.al.	2412.05043	null
2024-12-06	Get It Right: Improving Comprehensibility with Adaptable Speech Expression of a Humanoid Service Robot	Thomas Sievers et.al.	2412.05022	null
2024-12-05	PaintScene4D: Consistent 4D Scene Generation from Text Prompts	Vinayak Gupta et.al.	2412.04471	null
2024-12-05	LayerFusion: Harmonized Multi-Layer Text-to-Image Generation with Generative Priors	Yusuf Dalva et.al.	2412.04460	null
2024-12-05	Four-Plane Factorized Video Autoencoders	Mohammed Suhail et.al.	2412.04452	null
2024-12-05	MEMO: Memory-Guided Diffusion for Expressive Talking Video Generation	Longtao Zheng et.al.	2412.04448	null
2024-12-05	DiCoDe: Diffusion-Compressed Deep Tokens for Autoregressive Video Generation with Language Models	Yizhuo Li et.al.	2412.04446	null
2024-12-05	Learning Artistic Signatures: Symmetry Discovery and Style Transfer	Emma Finn et.al.	2412.04441	null
2024-12-05	GenMAC: Compositional Text-to-Video Generation with Multi-Agent Collaboration	Kaiyi Huang et.al.	2412.04440	null
2024-12-05	Divot: Diffusion Powers Video Tokenizer for Comprehension and Generation	Yuying Ge et.al.	2412.04432	link
2024-12-05	Infinity: Scaling Bitwise AutoRegressive Modeling for High-Resolution Image Synthesis	Jian Han et.al.	2412.04431	link
2024-12-05	Reversible molecular simulation for training classical and machine learning force fields	Joe G Greener et.al.	2412.04374	link
2024-12-05	Machine Theory of Mind for Autonomous Cyber-Defence	Luke Swaby et.al.	2412.04367	null
2024-12-05	ActFusion: a Unified Diffusion Model for Action Segmentation and Anticipation	Dayoung Gong et.al.	2412.04353	null
2024-12-05	RMD: A Simple Baseline for More General Human Motion Generation via Training-free Retrieval-Augmented Motion Diffuse	Zhouyingcheng Liao et.al.	2412.04343	null
2024-12-05	Likelihood-Scheduled Score-Based Generative Modeling for Fully 3D PET Image Reconstruction	George Webber et.al.	2412.04339	null
2024-12-05	Multi-Subject Image Synthesis as a Generative Prior for Single-Subject PET Image Reconstruction	George Webber et.al.	2412.04324	null
2024-12-04	Navigation World Models	Amir Bar et.al.	2412.03572	null
2024-12-04	MIDI: Multi-Instance Diffusion for Single Image to 3D Scene Generation	Zehuan Huang et.al.	2412.03558	null
2024-12-04	NODE-AdvGAN: Improving the transferability and perceptual similarity of adversarial examples by dynamic-system-driven adversarial generative model	Xinheng Xie et.al.	2412.03539	null
2024-12-04	NVComposer: Boosting Generative Novel View Synthesis with Multiple Sparse and Unposed Images	Lingen Li et.al.	2412.03517	null
2024-12-04	Distilling Diffusion Models to Efficient 3D LiDAR Scene Completion	Shengyuan Zhang et.al.	2412.03515	link
2024-12-04	Data Fusion of Semantic and Depth Information in the Context of Object Detection	Md Abu Yusuf et.al.	2412.03490	null
2024-12-04	Flow Matching with General Discrete Paths: A Kinetic-Optimal Perspective	Neta Shaul et.al.	2412.03487	null
2024-12-04	Pre-trained Multiple Latent Variable Generative Models are good defenders against Adversarial Attacks	Dario Serez et.al.	2412.03453	link
2024-12-04	CleanDIFT: Diffusion Features without Noise	Nick Stracke et.al.	2412.03439	link
2024-12-04	SINGER: Vivid Audio-driven Singing Video Generation with Multi-scale Spectral Diffusion Model	Yan Li et.al.	2412.03430	null
2024-12-04	Skel3D: Skeleton Guided Novel View Synthesis	Aron Fóthi et.al.	2412.03407	null
2024-12-04	Identifiability implies consistency of MLE in partially observed diffusions on a torus	Ibrahim Ekren et.al.	2412.03380	null
2024-12-04	TASR: Timestep-Aware Diffusion Model for Image Super-Resolution	Qinwei Lin et.al.	2412.03355	link
2024-12-04	DIVE: Taming DINO for Subject-Driven Video Editing	Yi Huang et.al.	2412.03347	null
2024-12-04	Geometry-guided Cross-view Diffusion for One-to-many Cross-view Image Synthesis	Tao Jun Lin et.al.	2412.03315	null
2024-12-03	Motion Prompting: Controlling Video Generation with Motion Trajectories	Daniel Geng et.al.	2412.02700	null
2024-12-03	Diffusion-based Visual Anagram as Multi-task Learning	Zhiyuan Xu et.al.	2412.02693	link
2024-12-03	FoundHand: Large-Scale Domain-Specific Learning for Controllable Hand Image Generation	Kefan Chen et.al.	2412.02690	null
2024-12-04	SNOOPI: Supercharged One-step Diffusion Distillation with Proper Guidance	Viet Nguyen et.al.	2412.02687	null
2024-12-03	AniGS: Animatable Gaussian Avatar from a Single Image with Inconsistent Gaussian Reconstruction	Lingteng Qiu et.al.	2412.02684	null
2024-12-03	Sharp-It: A Multi-view to Multi-view Diffusion Model for 3D Synthesis and Manipulation	Yiftach Edelstein et.al.	2412.02631	null
2024-12-03	The effect of priors on Learning with Restricted Boltzmann Machines	Gianluca Manzan et.al.	2412.02623	null
2024-12-03	ComPair-2: A Next Generation Medium Energy Gamma-ray Telescope Prototype	Regina Caputo et.al.	2412.02562	null
2024-12-03	The Two-Center Problem of Uncertain Points on Cactus Graphs	Haitao Xu et.al.	2412.02559	null
2024-12-03	ShadowHack: Hacking Shadows via Luminance-Color Divide and Conquer	Jin Hu et.al.	2412.02545	link
2024-12-03	Unveiling Concept Attribution in Diffusion Models	Quang H. Nguyen et.al.	2412.02542	link
2024-12-03	LLMForecaster: Improving Seasonal Event Forecasts with Unstructured Textual Data	Hanyu Zhang et.al.	2412.02525	null
2024-12-03	GerPS-Compare: Comparing NER methods for legal norm analysis	Sarah T. Bachinger et.al.	2412.02427	null
2024-12-03	It Takes Two: Real-time Co-Speech Two-person's Interaction Generation via Reactive Auto-regressive Diffusion Model	Mingyi Shi et.al.	2412.02419	null
2024-12-03	A Multi-Agent Framework for Extensible Structured Text Generation in PLCs	Donghao Yang et.al.	2412.02410	null
2024-11-29	Nanostructured micrometric-pore membranes for nanofiltration: Micrometric geometry may optimize performance, energy efficiency and operational lifetime	J. C. Verde et.al.	2411.19900	null
2024-11-29	Input-Output Optics as a Causal Time Series Mapping: A Generative Machine Learning Solution	Abhijit Sen et.al.	2411.19897	null
2024-11-29	MoTe: Learning Motion-Text Diffusion Model for Multiple Generation Tasks	Yiming Wu et.al.	2411.19786	null
2024-11-29	Riemannian Denoising Score Matching for Molecular Structure Optimization with Accurate Energy	Jeheon Woo et.al.	2411.19769	null
2024-11-29	JetFormer: An Autoregressive Generative Model of Raw Images and Text	Michael Tschannen et.al.	2411.19722	link
2024-11-29	Inverse Design of Mechanical Metamaterials Using a Point-Cloud-Based Deep Generative Model	Seungwook Hong et.al.	2411.19681	null
2024-11-29	TexGaussian: Generating High-quality PBR Material via Octree-based 3D Gaussian Splatting	Bojun Xiong et.al.	2411.19654	link
2024-11-29	Uniform Attention Maps: Boosting Image Fidelity in Reconstruction and Editing	Wenyi Mo et.al.	2411.19652	link
2024-11-29	Enhancing Security in Third-Party Library Reuse -- Comprehensive Detection of 1-day Vulnerability through Code Patch Analysis	Shangzhi Xu et.al.	2411.19648	null
2024-11-29	Accelerating Multimodal Large Language Models via Dynamic Visual-Token Exit and the Empirical Findings	Qiong Wu et.al.	2411.19628	link
2024-11-29	Unimib Assistant: designing a student-friendly RAG-based chatbot for all their needs	Chiara Antico et.al.	2411.19554	null
2024-11-29	Deepfake Media Generation and Detection in the Generative AI Era: A Survey and Outlook	Florinel-Alin Croitoru et.al.	2411.19537	link
2024-11-29	Quantized Delta Weight Is Safety Keeper	Yule Liu et.al.	2411.19530	null
2024-12-02	DisCoRD: Discrete Tokens to Continuous Motion via Rectified Flow Decoding	Jungbin Cho et.al.	2411.19527	null
2024-11-29	Ditto: Motion-Space Diffusion for Controllable Realtime Talking Head Synthesis	Tianqi Li et.al.	2411.19509	link
2024-11-27	Textured Gaussians for Enhanced 3D Scene Appearance Modeling	Brian Chao et.al.	2411.18625	null
2024-11-27	GeneMAN: Generalizable Single-Image 3D Human Reconstruction from Multi-Source Human Data	Wentao Wang et.al.	2411.18624	null
2024-11-27	Diffusion Self-Distillation for Zero-Shot Customized Image Generation	Shengqu Cai et.al.	2411.18616	null
2024-11-27	CAT4D: Create Anything in 4D with Multi-View Video Diffusion Models	Rundi Wu et.al.	2411.18613	null
2024-11-27	Evaluating and Improving the Effectiveness of Synthetic Chest X-Rays for Medical Image Analysis	Eva Prakash et.al.	2411.18602	null
2024-11-27	Bit symmetry entails the symmetry of the quantum transition probability	Gerd Niestegge et.al.	2411.18589	null
2024-11-27	Building Confidence in Deep Generative Protein Design	Tianyuan Zheng et.al.	2411.18568	link
2024-11-27	High-throughput antibody screening with high-quality factor nanophotonics and bioprinting	Sajjad Abdollahramezani et.al.	2411.18557	null
2024-11-27	FAM Diffusion: Frequency and Attention Modulation for High-Resolution Image Generation with Stable Diffusion	Haosen Yang et.al.	2411.18552	null
2024-11-28	Enhancing weed detection performance by means of GenAI-based image augmentation	Sourav Modak et.al.	2411.18513	null
2024-11-27	GATE OpenING: A Comprehensive Benchmark for Judging Open-ended Interleaved Image-Text Generation	Pengfei Zhou et.al.	2411.18499	null
2024-11-27	Synthetic ECG Generation for Data Augmentation and Transfer Learning in Arrhythmia Classification	José Fernando Núñez et.al.	2411.18456	null
2024-11-27	Is my Meeting Summary Good? Estimating Quality with a Multi-LLM Evaluator	Frederic Kirstein et.al.	2411.18444	null
2024-11-27	Learning the Evolution of Physical Structure of Galaxies via Diffusion Models	Andrew Lizarraga et.al.	2411.18440	link
2024-11-27	Search for heavy scalar or pseudoscalar states in $\mathrm{t \bar{t}}$ events at CMS	Laurids Jeppe et.al.	2411.18414	null
2024-11-27	StableAnimator: High-Quality Identity-Preserving Human Image Animation	Shuyuan Tu et.al.	2411.17697	link
2024-11-26	ScribbleLight: Single Image Indoor Relighting with Scribbles	Jun Myeong Choi et.al.	2411.17696	null
2024-11-26	Visatronic: A Multimodal Decoder-Only Model for Speech Synthesis	Akshita Gupta et.al.	2411.17690	null
2024-11-26	GenDeg: Diffusion-Based Degradation Synthesis for Generalizable All-in-One Image Restoration	Sudarshan Rajagopalan et.al.	2411.17687	null
2024-11-26	Semi-analytical model for the calculation of solar radiation pressure and its effects on a LEO satellite with predicting the change in position vectors using machine learning techniques	Pranava Seth et.al.	2411.17626	null
2024-11-26	Accelerating Vision Diffusion Transformers with Skip Branches	Guanjie Chen et.al.	2411.17616	link
2024-11-26	Mixed-State Quantum Denoising Diffusion Probabilistic Model	Gino Kwun et.al.	2411.17608	null
2024-11-26	Making History Readable	Bipasha Banerjee et.al.	2411.17600	null
2024-11-26	VideoDirector: Precise Video Editing via Text-to-Video Models	Yukun Wang et.al.	2411.17592	null
2024-11-26	Rapid Deployment of Domain-specific Hyperspectral Image Processors with Application to Autonomous Driving	Jon Gutiérrez-Zaballa et.al.	2411.17543	null
2024-11-26	Metaverse Innovation Canvas: A Tool for Extended Reality Product/Service Development	Amir Reza Asadi et.al.	2411.17541	null
2024-11-26	IMPROVE: Improving Medical Plausibility without Reliance on HumanValidation -- An Enhanced Prototype-Guided Diffusion Framework	Anurag Shandilya et.al.	2411.17535	null
2024-11-26	FTMoMamba: Motion Generation with Frequency and Text State Space Models	Chengjian Li et.al.	2411.17532	null
2024-11-26	Exact and Heuristic Approaches for the Covering Tour Location Routing Problem	Andreas Hagn et.al.	2411.17510	link
2024-11-26	WF-VAE: Enhancing Video VAE by Wavelet-Driven Energy Flow for Latent Video Diffusion Model	Zongjian Li et.al.	2411.17459	link
2024-11-25	Generative Omnimatte: Learning to Decompose Video into Layers	Yao-Chih Lee et.al.	2411.16683	null
2024-11-25	Diffusion Features for Zero-Shot 6DoF Object Pose Estimation	Bernd Von Gimborn et.al.	2411.16668	null
2024-11-25	DreamRunner: Fine-Grained Storytelling Video Generation with Retrieval-Augmented Motion Adaptation	Zun Wang et.al.	2411.16657	null
2024-11-25	Exploring Discrete Flow Matching for 3D De Novo Molecule Generation	Ian Dunn et.al.	2411.16644	link
2024-11-25	LegoPET: Hierarchical Feature Guided Conditional Diffusion for PET Image Reconstruction	Yiran Sun et.al.	2411.16629	link
2024-11-25	Chat2SVG: Vector Graphics Generation with Large Language Models and Image Diffusion Models	Ronghuan Wu et.al.	2411.16602	null
2024-11-25	Unlocking The Potential of Adaptive Attacks on Diffusion-Based Purification	Andre Kassis et.al.	2411.16598	link
2024-11-25	Rethinking Diffusion for Text-Driven Human Motion Generation	Zichong Meng et.al.	2411.16575	null
2024-11-25	Representation Collapsing Problems in Vector Quantization	Wenhao Zhao et.al.	2411.16550	null
2024-11-25	ADOBI: Adaptive Diffusion Bridge For Blind Inverse Problems with Application to MRI Reconstruction	Yuyang Hu et.al.	2411.16535	null
2024-11-25	PriorPath: Coarse-To-Fine Approach for Controlled De-Novo Pathology Semantic Masks Generation	Nati Daniel et.al.	2411.16515	null
2024-11-25	Noise Diffusion for Enhancing Semantic Faithfulness in Text-to-Image Synthesis	Boming Miao et.al.	2411.16503	null
2024-11-25	Multi-Resolution Generative Modeling of Human Motion from Limited Data	David Eduardo Moreno-Villamarín et.al.	2411.16498	null
2024-11-25	Learning by Analogy: Enhancing Few-Shot Prompting for Math Word Problem Solving with Computational Graph-Based Retrieval	Xiaocong Yang et.al.	2411.16454	null
2024-11-25	Model-based reinforcement corrosion prediction: Continuous calibration with Bayesian optimization and corrosion wire sensor data	A. Potnis et.al.	2411.16447	null
2024-11-22	DiffusionDrive: Truncated Diffusion Model for End-to-End Autonomous Driving	Bencheng Liao et.al.	2411.15139	link
2024-11-22	Material Anything: Generating Materials for Any 3D Object via Diffusion	Xin Huang et.al.	2411.15138	null
2024-11-22	VideoRepair: Improving Text-to-Video Generation via Misalignment Evaluation and Localized Refinement	Daeun Lee et.al.	2411.15115	null
2024-11-22	RE-Bench: Evaluating frontier AI R&D capabilities of language model agents against human experts	Hjalmar Wijk et.al.	2411.15114	link
2024-11-22	Efficient Pruning of Text-to-Image Models: Insights from Pruning Stable Diffusion	Samarth N Ramesh et.al.	2411.15113	null
2024-11-22	Leapfrog Latent Consistency Model (LLCM) for Medical Images Generation	Lakshmikar R. Polamreddy et.al.	2411.15084	link
2024-11-22	Towards Speaker Identification with Minimal Dataset and Constrained Resources using 1D-Convolution Neural Network	Irfan Nafiz Shahan et.al.	2411.15082	link
2024-11-22	Empowering Clients: Transformation of Design Processes Due to Generative AI	Johannes Schneider et.al.	2411.15061	null
2024-11-22	The 1D nonlocal Fisher-KPP equation with a top hat kernel. Part 3. The effect of perturbations in the kernel	David John Needham et.al.	2411.15054	null
2024-11-22	FloAt: Flow Warping of Self-Attention for Clothing Animation Generation	Swasti Shreya Mishra et.al.	2411.15028	null
2024-11-22	Enhancing Exploration with Diffusion Policies in Hybrid Off-Policy RL: Application to Non-Prehensile Manipulation	Huy Le et.al.	2411.14913	null
2024-11-22	Dynamically Encircled Higher-order Exceptional Points in an Optical Fiber	Arpan Roy et.al.	2411.14874	null
2024-11-22	Prioritize Denoising Steps on Diffusion Model Preference Alignment via Explicit Denoised Distribution Estimation	Dingyuan Shi et.al.	2411.14871	null
2024-11-22	Latent Schrodinger Bridge: Prompting Latent Diffusion for Fast Unpaired Image-to-Image Translation	Jeongsol Kim et.al.	2411.14863	null
2024-11-22	Style-Friendly SNR Sampler for Style-Driven Generation	Jooyoung Choi et.al.	2411.14793	null
2024-11-21	Stable Flow: Vital Layers for Training-Free Image Editing	Omri Avrahami et.al.	2411.14430	link
2024-11-21	Transformer-based Heuristic for Advanced Air Mobility Planning	Jun Xiang et.al.	2411.14427	null
2024-11-21	A Python-Based Approach to Sputter Deposition Simulations in Combinatorial Materials Science	Felix Thelen et.al.	2411.14413	null
2024-11-21	Multi-Agent Environments for Vehicle Routing Problems	Ricardo Gama et.al.	2411.14411	link
2024-11-21	Baking Gaussian Splatting into Diffusion Denoiser for Fast and Scalable Single-stage Image-to-3D Generation	Yuanhao Cai et.al.	2411.14384	null
2024-11-21	CoNFiLD-inlet: Synthetic Turbulence Inflow Using Generative Latent Diffusion Models with Neural Fields	Xin-Yang Liu et.al.	2411.14378	null
2024-11-21	Enhancing Medical Image Segmentation with Deep Learning and Diffusion Models	Houze Liu et.al.	2411.14353	null
2024-11-21	DINO-X: A Unified Vision Model for Open-World Object Detection and Understanding	Tianhe Ren et.al.	2411.14347	link
2024-11-21	Lower Dimensional Spherical Representation of Medium Voltage Load Profiles for Visualization, Outlier Detection, and Generative Modelling	Edgar Mauricio Salazar Duque et.al.	2411.14346	null
2024-11-21	StereoCrafter-Zero: Zero-Shot Stereo Video Generation with Noisy Restart	Jian Shi et.al.	2411.14295	link
2024-11-21	Efficient Aspect-Based Summarization of Climate Change Reports with Small Language Models	Iacopo Ghinassi et.al.	2411.14272	link
2024-11-21	Guided MRI Reconstruction via Schrödinger Bridge	Yue Wang et.al.	2411.14269	null
2024-11-21	Regional Attention for Shadow Removal	Hengxing Liu et.al.	2411.14201	link
2024-11-21	TaQ-DiT: Time-aware Quantization for Diffusion Transformers	Xinyan Liu et.al.	2411.14172	null
2024-11-21	Creating a Formally Verified Neural Network for Autonomous Navigation: An Experience Report	Syed Ali Asadullah Bukhari et.al.	2411.14163	link
2024-11-20	REDUCIO! Generating 1024 $\times$ 1024 Video within 16 Seconds using Extremely Compressed Motion Latents	Rui Tian et.al.	2411.13552	link
2024-11-20	Identity Preserving 3D Head Stylization with Multiview Score Distillation	Bahri Batuhan Bilecen et.al.	2411.13536	null
2024-11-20	VBench++: Comprehensive and Versatile Benchmark Suite for Video Generative Models	Ziqi Huang et.al.	2411.13503	link
2024-11-20	LIMBA: An Open-Source Framework for the Preservation and Valorization of Low-Resource Languages using Generative Models	Salvatore Mario Carta et.al.	2411.13453	null
2024-11-20	Heuristically Adaptive Diffusion-Model Evolutionary Strategy	Benedikt Hartl et.al.	2411.13420	null
2024-11-20	Energy-based generative models for monoclonal antibodies	Paul Pereira et.al.	2411.13390	link
2024-11-20	Small and Close-In Planets are Uncommon around A-type Stars	Steven Giacalone et.al.	2411.13363	null
2024-11-20	Vertical Validation: Evaluating Implicit Generative Models for Graphs on Thin Support Regions	Mai Elkady et.al.	2411.13358	null
2024-11-20	A CSI Feedback Framework based on Transmitting the Important Values and Generating the Others	Zhilin Du et.al.	2411.13298	null
2024-11-21	Structure-Based Molecule Optimization via Gradient-Guided Bayesian Update	Keyue Qiu et.al.	2411.13280	null
2024-11-20	XMask3D: Cross-modal Mask Reasoning for Open Vocabulary 3D Semantic Segmentation	Ziyi Wang et.al.	2411.13243	link
2024-11-20	BIPro: Zero-shot Chinese Poem Generation via Block Inverse Prompting Constrained Generation Framework	Xu Zou et.al.	2411.13237	null
2024-11-20	Building music with Lego bricks and Raspberry Pi	Ana M. Barbancho et.al.	2411.13224	null
2024-11-20	A computational framework for integrating Predictive processes with evidence Accumulation Models (PAM)	Antonino Visalli et.al.	2411.13203	link
2024-11-20	OpenMS WebApps: Building User-Friendly Solutions for MS Analysis	Tom David Müller et.al.	2411.13189	link
2024-11-19	Enhancing Multi-Class Disease Classification: Neoplasms, Cardiovascular, Nervous System, and Digestive Disorders Using Advanced LLMs	Ahmed Akib Jawad Karim et.al.	2411.12712	null
2024-11-19	OrigamiPlot: An R Package and Shiny Web App Enhanced Visualizations for Multivariate Data	Yiwen Lu et.al.	2411.12674	null
2024-11-19	Auto-Evaluation with Few Labels through Post-hoc Regression	Benjamin Eyre et.al.	2411.12665	null
2024-11-19	PoM: Efficient Image and Video Generation with the Polynomial Mixer	David Picard et.al.	2411.12663	link
2024-11-19	Optimizing Airline Reservation Systems with Edge-Enabled Microservices: A Framework for Real-Time Data Processing and Enhanced User Responsiveness	Biman Barua et.al.	2411.12650	null
2024-11-19	DLBacktrace: A Model Agnostic Explainability for any Deep Learning Models	Vinay Kumar Sankarapu et.al.	2411.12643	link
2024-11-19	Improving Controllability and Editability for Pretrained Text-to-Music Generation Models	Yixiao Zhang et.al.	2411.12641	null
2024-11-19	Universal programmable waveguide arrays	Akram Youssry et.al.	2411.12610	null
2024-11-19	Whisper Finetuning on Nepali Language	Sanjay Rijal et.al.	2411.12587	null
2024-11-19	Predicting Customer Satisfaction by Replicating the Survey Response Distribution	Etienne Manderscheid et.al.	2411.12539	null
2024-11-19	Data Pruning in Generative Diffusion Models	Rania Briq et.al.	2411.12523	link
2024-11-19	Probe-Me-Not: Protecting Pre-trained Encoders from Malicious Probing	Ruyi Ding et.al.	2411.12508	null
2024-11-19	Empirical Privacy Evaluations of Generative and Predictive Machine Learning Models -- A review and challenges for practice	Flavio Hafner et.al.	2411.12451	null
2024-11-19	Frequency-Aware Guidance for Blind Image Restoration via Diffusion Models	Jun Xiao et.al.	2411.12450	null
2024-11-19	A general modeling and simulation framework for dynamic vehicle routing	Markó Horváth et.al.	2411.12406	link
2024-11-18	QARM: Quantitative Alignment Multi-Modal Recommendation at Kuaishou	Xinchen Luo et.al.	2411.11739	null
2024-11-18	Aligning Few-Step Diffusion Models with Dense Reward Difference Learning	Ziyi Zhang et.al.	2411.11727	link
2024-11-18	Multiscale nonlinear integration drives accurate encoding of input information	Giorgio Nicoletti et.al.	2411.11710	null
2024-11-18	Robust Reinforcement Learning under Diffusion Models for Data with Jumps	Chenyang Jiang et.al.	2411.11697	null
2024-11-18	Active droplets controlled by enzymatic reactions	Jacques Fries et.al.	2411.11696	null
2024-11-18	Do Captioning Metrics Reflect Music Semantic Alignment?	Jinwoo Lee et.al.	2411.11692	null
2024-11-18	Conceptwm: A Diffusion Model Watermark for Concept Protection	Liangqi Lei et.al.	2411.11688	null
2024-11-19	GNN-Based Code Annotation Logic for Establishing Security Boundaries in C Code	Varun Gadey et.al.	2411.11567	null
2024-11-19	Cascaded Diffusion Models for 2D and 3D Microscopy Image Synthesis to Enhance Cell Segmentation	Rüveyda Yilmaz et.al.	2411.11515	link
2024-11-18	Collaborative Contrastive Network for Click-Through Rate Prediction	Chen Gao et.al.	2411.11508	null
2024-11-18	LaVin-DiT: Large Vision Diffusion Transformer	Zhaoqing Wang et.al.	2411.11505	null
2024-11-18	Alien Recombination: Exploring Concept Blends Beyond Human Cognitive Availability in Visual Art	Alejandro Hernandez et.al.	2411.11494	null
2024-11-18	MVLight: Relightable Text-to-3D Generation via Light-conditioned Multi-View Diffusion	Dongseok Shim et.al.	2411.11475	null
2024-11-18	GLDesigner: Leveraging Multi-Modal LLMs as Designer for Enhanced Aesthetic Text Glyph Layouts	Junwen He et.al.	2411.11435	null
2024-11-18	CLUE-MARK: Watermarking Diffusion Models using CLWE	Kareem Shehata et.al.	2411.11434	null
2024-11-15	M-VAR: Decoupled Scale-wise Autoregressive Modeling for High-Quality Image Generation	Sucheng Ren et.al.	2411.10433	link
2024-11-15	Mitigating Parameter Degeneracy using Joint Conditional Diffusion Model for WECC Composite Load Model in Power Systems	Feiqin Zhu et.al.	2411.10431	null
2024-11-15	Multiscale Dubuc: A New Similarity Measure for Time Series	Mahsa Khazaei et.al.	2411.10418	link
2024-11-15	Experimental generation of extreme electron beams for advanced accelerator applications	Claudio Emma et.al.	2411.10413	null
2024-11-15	How to Build a Quantum Supercomputer: Scaling Challenges and Opportunities	Masoud Mohseni et.al.	2411.10406	null
2024-11-15	Nonlinearity-Driven Morphing and Control of Topological Modes in Non-Hermitian Systems	Zhao-Fan Cai et.al.	2411.10398	null
2024-11-15	Towards High-Fidelity 3D Portrait Generation with Rich Details by Cross-View Prior-Aware Diffusion	Haoran Wei et.al.	2411.10369	null
2024-11-15	Safe Text-to-Image Generation: Simply Sanitize the Prompt Embedding	Huming Qiu et.al.	2411.10329	null
2024-11-15	Probabilistic Prior Driven Attention Mechanism Based on Diffusion Model for Imaging Through Atmospheric Turbulence	Guodong Sun et.al.	2411.10321	null
2024-11-15	Assortment Optimization under the Multinomial Logit Model with Covering Constraints	Omar El Housni et.al.	2411.10310	null
2024-11-15	Modification Takes Courage: Seamless Image Stitching via Reference-Driven Inpainting	Ziqi Xie et.al.	2411.10309	link
2024-11-15	MDHP-Net: Detecting Injection Attacks on In-vehicle Network using Multi-Dimensional Hawkes Process and Temporal Model	Qi Liu et.al.	2411.10258	null
2024-11-15	The Unreasonable Effectiveness of Guidance for Diffusion Models	Tim Kaiser et.al.	2411.10257	null
2024-11-15	Smooth transport map via diffusion process	Arthur Stéphanovitch et.al.	2411.10235	null
2024-11-15	ColorEdit: Training-free Image-Guided Color editing with diffusion model	Xingxi Yin et.al.	2411.10232	null
2024-11-14	A Bayesian Optimization Approach to Machine Translation Reranking	Julius Cheng et.al.	2411.09694	link
2024-11-14	SimTube: Generating Simulated Video Comments through Multimodal AI and User Personas	Yu-Kai Hung et.al.	2411.09577	null
2024-11-14	Golden Noise for Diffusion Models: A Learning Framework	Zikai Zhou et.al.	2411.09502	link
2024-11-14	Sparse Bayesian Generative Modeling for Compressive Sensing	Benedikt Böck et.al.	2411.09483	link
2024-11-14	DiffRoad: Realistic and Diverse Road Scenario Generation for Autonomous Vehicle Testing	Junjie Zhou et.al.	2411.09451	null
2024-11-14	Image Regeneration: Evaluating Text-to-Image Model via Generating Identical Image with Multimodal Large Language Models	Chutian Meng et.al.	2411.09449	null
2024-11-14	A survey of probabilistic generative frameworks for molecular simulations	Richard John et.al.	2411.09388	link
2024-11-14	Multi-scale Generative Modeling for Fast Sampling	Xiongye Xiao et.al.	2411.09356	null
2024-11-14	ParaLBench: A Large-Scale Benchmark for Computational Paralinguistics over Acoustic Foundation Models	Zixing Zhang et.al.	2411.09349	null
2024-11-15	Approximate Probabilistic Inference for Time-Series Data A Robust Latent Gaussian Model With Temporal Awareness	Anton Johansson et.al.	2411.09312	null
2024-11-14	EEG-Based Speech Decoding: A Novel Approach Using Multi-Kernel Ensemble Diffusion Models	Soowon Kim et.al.	2411.09302	null
2024-11-14	LES-Talker: Fine-Grained Emotion Editing for Talking Head Generation in Linear Emotion Space	Guanwen Feng et.al.	2411.09268	null
2024-11-14	Jailbreak Attacks and Defenses against Multimodal Generative Models: A Survey	Xuannan Liu et.al.	2411.09259	link
2024-11-14	RibCageImp: A Deep Learning Framework for 3D Ribcage Implant Generation	Gyanendra Chaubey et.al.	2411.09204	null
2024-11-14	Improvement and Implementation of a Speech Emotion Recognition Model Based on Dual-Layer LSTM	Xiaoran Yang et.al.	2411.09189	null
2024-11-13	4D Gaussian Splatting in the Wild with Uncertainty-Aware Regularization	Mijeong Kim et.al.	2411.08879	null
2024-11-13	A generalized software framework for consolidation of radiotherapy planning and delivery data from diverse data sources	Yasin Abdulkadir et.al.	2411.08876	null
2024-11-13	Offline Adaptation of Quadruped Locomotion using Diffusion Models	Reece O'Mahoney et.al.	2411.08832	null
2024-11-13	SANDWICH: Towards an Offline, Differentiable, Fully-Trainable Wireless Neural Ray-Tracing Surrogate	Yifei Jin et.al.	2411.08767	null
2024-11-13	Analyst Reports and Stock Performance: Evidence from the Chinese Market	Rui Liu et.al.	2411.08726	null
2024-11-14	Reducing ADC Front-end Costs During Training of On-sensor Printed Multilayer Perceptrons	Florentia Afentaki et.al.	2411.08674	link
2024-11-13	Joint Model Caching and Resource Allocation in Generative AI-Enabled Wireless Edge Networks	Zhang Liu et.al.	2411.08672	null
2024-11-13	Toward Human Understanding with Controllable Synthesis	Hanz Cuevas-Velasquez et.al.	2411.08663	null
2024-11-13	The Galactica database: an open, generic and versatile tool for the dissemination of simulation data in astrophysics	Damien Chapon et.al.	2411.08647	null
2024-11-13	Towards More Accurate Fake Detection on Images Generated from Advanced Generative and Neural Rendering Models	Chengdong Dong et.al.	2411.08642	null
2024-11-13	Deep Generative Demand Learning for Newsvendor and Pricing	Shijin Gong et.al.	2411.08631	null
2024-11-13	LG-Gaze: Learning Geometry-aware Continuous Prompts for Language-Guided Gaze Estimation	Pengwei Yin et.al.	2411.08606	null
2024-11-13	CorrSynth -- A Correlated Sampling Method for Diverse Dataset Generation from LLMs	Suhas S Kowshik et.al.	2411.08553	null
2024-11-13	Explainers' Mental Representations of Explainees' Needs in Everyday Explanations	Michael Erol Schaffer et.al.	2411.08514	null
2024-11-13	HyperFace: Generating Synthetic Face Recognition Datasets by Exploring Face Embedding Hypersphere	Hatef Otroshi Shahreza et.al.	2411.08470	null
2024-11-12	Scaling Properties of Diffusion Models for Perceptual Tasks	Rahul Ravishankar et.al.	2411.08034	null
2024-11-12	GaussianAnything: Interactive Point Cloud Latent Diffusion for 3D Generation	Yushi Lan et.al.	2411.08033	null
2024-11-12	Wavelet Latent Diffusion (Wala): Billion-Parameter 3D Generative Model with Compact Wavelet Encodings	Aditya Sanghi et.al.	2411.08017	link
2024-11-12	JanusFlow: Harmonizing Autoregression and Rectified Flow for Unified Multimodal Understanding and Generation	Yiyang Ma et.al.	2411.07975	link
2024-11-12	Diverse capability and scaling of diffusion and auto-regressive models when learning abstract rules	Binxu Wang et.al.	2411.07873	null
2024-11-12	Trustful LLMs: Customizing and Grounding Text Generation with Knowledge Bases and Dual Decoders	Xiaofeng Zhu et.al.	2411.07870	null
2024-11-12	CDXFormer: Boosting Remote Sensing Change Detection with Extended Long Short-Term Memory	Zhenkai Wu et.al.	2411.07863	link
2024-11-12	Sparsity-Aware Optimization of In-Memory Bayesian Binary Neural Network Accelerators	Prabodh Katti et.al.	2411.07842	null
2024-11-12	Novel View Synthesis with Pixel-Space Diffusion Models	Noam Elata et.al.	2411.07765	null
2024-11-12	Nanosecond nanothermometry in an electron microscope	Florian Castioni et.al.	2411.07764	null
2024-11-12	LapGSR: Laplacian Reconstructive Network for Guided Thermal Super-Resolution	Aditya Kasliwal et.al.	2411.07750	null
2024-11-12	The relationship between general equilibrium models with infinite-lived agents and overlapping generations models, and some applications	Ngoc-Sang Pham et.al.	2411.07674	null
2024-11-12	Evaluating the Generation of Spatial Relations in Text and Image Generative Models	Shang Hong Sim et.al.	2411.07664	null
2024-11-12	Leveraging Previous Steps: A Training-free Fast Solver for Flow Diffusion	Kaiyu Song et.al.	2411.07627	null
2024-11-12	Unraveling the Connections between Flow Matching and Diffusion Probabilistic Models in Training-free Conditional Generation	Kaiyu Song et.al.	2411.07625	null
2024-11-11	Score-based generative diffusion with "active" correlated noise sources	Alexandra Lamtyugina et.al.	2411.07233	null
2024-11-12	Add-it: Training-Free Object Insertion in Images With Pretrained Diffusion Models	Yoad Tewel et.al.	2411.07232	null
2024-11-11	Learning from Limited and Imperfect Data	Harsh Rangwani et.al.	2411.07229	null
2024-11-11	TempCharBERT: Keystroke Dynamics for Continuous Access Control Based on Pre-trained Language Models	Matheus Simão et.al.	2411.07224	null
2024-11-11	DLCR: A Generative Data Expansion Framework via Diffusion for Clothes-Changing Person Re-ID	Nyle Siddiqui et.al.	2411.07205	link
2024-11-11	Crossover from inhomogeneous to homogeneous response of a resonantly driven hBN quantum emitter	Domitille Gérard et.al.	2411.07202	null
2024-11-11	OmniEdit: Building Image Editing Generalist Models Through Specialist Supervision	Cong Wei et.al.	2411.07199	null
2024-11-11	More Expressive Attention with Negative Weights	Ang Lv et.al.	2411.07176	link
2024-11-11	Edify 3D: Scalable High-Quality 3D Asset Generation	NVIDIA et.al.	2411.07135	null
2024-11-11	Benchmarking LLMs' Judgments with No Gold Standard	Shengwei Xu et.al.	2411.07127	link
2024-11-11	Edify Image: High-Quality Image Generation with Pixel Space Laplacian Diffusion Models	NVIDIA et.al.	2411.07126	null
2024-11-11	Decoding Visual Experience and Mapping Semantics through Whole-Brain Analysis Using fMRI Foundation Models	Yanchen Wang et.al.	2411.07121	link
2024-11-11	Scaling Mesh Generation via Compressive Tokenization	Haohan Weng et.al.	2411.07025	link
2024-11-11	An Electrocardiogram Monitoring Device Based on STM32	Wenqi Guan et.al.	2411.06962	null
2024-11-11	Generative Feature Training of Thin 2-Layer Networks	Johannes Hertrich et.al.	2411.06848	link
2024-11-08	StdGEN: Semantic-Decomposed 3D Character Generation from Single Images	Yuze He et.al.	2411.05738	null
2024-11-08	Image2Text2Image: A Novel Framework for Label-Free Evaluation of Image-to-Text Generation with Text-to-Image Diffusion Models	Jia-Hong Huang et.al.	2411.05706	null
2024-11-08	Improving Molecular Graph Generation with Flow Matching and Optimal Transport	Xiaoyang Hou et.al.	2411.05676	null
2024-11-08	Towards Lifelong Few-Shot Customization of Text-to-Image Diffusion	Nan Song et.al.	2411.05544	null
2024-11-08	Improving image synthesis with diffusion-negative sampling	Alakh Desai et.al.	2411.05473	null
2024-11-08	Bridging the Gap between Learning and Inference for Diffusion-Based Molecule Generation	Peidong Liu et.al.	2411.05472	link
2024-11-08	IntellBot: Retrieval Augmented LLM Chatbot for Cyber Threat Knowledge Delivery	Dincy R. Arikkat et.al.	2411.05442	link
2024-11-08	RED: Residual Estimation Diffusion for Low-Dose PET Sinogram Reconstruction	Xingyu Ai et.al.	2411.05354	link
2024-11-08	Electro-diffusive modeling and the role of spine geometry on action potential propagation in neurons	Rahul Gulati et.al.	2411.05329	null
2024-11-08	Social balance in directed networks	Bingjie Hao et.al.	2411.05327	null
2024-11-08	SeqRFM: Fast RFM Analysis in Sequence Data	Yanxin Zheng et.al.	2411.05317	link
2024-11-08	Differentiable Calibration of Inexact Stochastic Simulation Models via Kernel Score Minimization	Ziwei Su et.al.	2411.05315	null
2024-11-08	A Real-time Face Mask Detection and Social Distancing System for COVID-19 using Attention-InceptionV3 Model	Abdullah Al Asif et.al.	2411.05312	null
2024-11-08	Adaptive Whole-Body PET Image Denoising Using 3D Diffusion Models with ControlNet	Boxiao Yu et.al.	2411.05302	null
2024-11-08	GPT Semantic Cache: Reducing LLM Costs and Latency via Semantic Embedding Caching	Sajal Regmi et.al.	2411.05276	null
2024-11-07	SVDQunat: Absorbing Outliers by Low-Rank Components for 4-Bit Diffusion Models	Muyang Li et.al.	2411.05007	link
2024-11-07	ProEdit: Simple Progression is All You Need for High-Quality 3D Scene Editing	Jun-Kun Chen et.al.	2411.05006	null
2024-11-07	Diff-2-in-1: Bridging Generation and Dense Perception with Diffusion Models	Shuhong Zheng et.al.	2411.05005	null
2024-11-07	ReCapture: Generative Video Camera Controls for User-Provided Videos using Masked Video Fine-Tuning	David Junhao Zhang et.al.	2411.05003	null
2024-11-07	SG-I2V: Self-Guided Trajectory Control in Image-to-Video Generation	Koichi Namekata et.al.	2411.04989	null
2024-11-07	Few-Shot Task Learning through Inverse Generative Modeling	Aviv Netanyahu et.al.	2411.04987	null
2024-11-07	How fast does the WallGo? A package for computing wall velocities in first-order phase transitions	Andreas Ekstedt et.al.	2411.04970	link
2024-11-07	VAIR: Visuo-Acoustic Implicit Representations for Low-Cost, Multi-Modal Transparent Surface Reconstruction in Indoor Scenes	Advaith V. Sethuraman et.al.	2411.04963	null
2024-11-07	Uncovering Hidden Subspaces in Video Diffusion Models Using Re-Identification	Mischa Dombrowski et.al.	2411.04956	null
2024-11-07	Fed-LDR: Federated Local Data-infused Graph Creation with Node-centric Model Refinement	Jiechao Gao et.al.	2411.04936	null
2024-11-07	DimensionX: Create Any 3D and 4D Scenes from a Single Image with Controllable Video Diffusion	Wenqiang Sun et.al.	2411.04928	null
2024-11-07	StoryAgent: Customized Storytelling Video Generation via Multi-Agent Collaboration	Panwen Hu et.al.	2411.04925	null
2024-11-07	Stem-OB: Generalizable Visual Imitation Learning with Stem-Like Convergent Observation through Diffusion Inversion	Kaizhe Hu et.al.	2411.04919	link
2024-11-07	GASE: Generatively Augmented Sentence Encoding	Manuel Frank et.al.	2411.04914	null
2024-11-07	Controlling Human Shape and Pose in Text-to-Image Diffusion Models via Domain Adaptation	Benito Buchheim et.al.	2411.04724	null
2024-11-06	Community Forensics: Using Thousands of Generators to Train Fake Image Detectors	Jeongsoo Park et.al.	2411.04125	null
2024-11-06	Stepping Forward on the Last Mile	Chen Feng et.al.	2411.04036	null
2024-11-06	Prototyping O-RAN Enabled UAV Experimentation for the AERPAW Testbed	Joshua Moore et.al.	2411.04027	null
2024-11-06	Object-Centric Dexterous Manipulation from Human Motion Data	Yuanpei Chen et.al.	2411.04005	null
2024-11-06	Synomaly Noise and Multi-Stage Diffusion: A Novel Approach for Unsupervised Anomaly Detection in Ultrasound Imaging	Yuan Bi et.al.	2411.04004	null
2024-11-06	ET-SEED: Efficient Trajectory-Level SE(3) Equivariant Diffusion Policy	Chenrui Tie et.al.	2411.03990	null
2024-11-06	ReEdit: Multimodal Exemplar-Based Image Editing with Diffusion Models	Ashutosh Srivastava et.al.	2411.03982	null
2024-11-06	Customized Multiple Clustering via Multi-Modal Subspace Proxy Learning	Jiawei Yao et.al.	2411.03978	link
2024-11-06	Bayesian algorithmic perfumery: A Hierarchical Relevance Vector Machine for the Estimation of Personalized Fragrance Preferences based on Three Sensory Layers and Jungian Personality Archetypes	Rolando Gonzales Martinez et.al.	2411.03965	null
2024-11-06	Long-Form Text-to-Music Generation with Adaptive Prompts: A Case of Study in Tabletop Role-Playing Games Soundtracks	Felipe Marra et.al.	2411.03948	link
2024-11-06	Can Custom Models Learn In-Context? An Exploration of Hybrid Architecture Performance on In-Context Learning Tasks	Ryan Campbell et.al.	2411.03945	link
2024-11-06	GUIDE-VAE: Advancing Data Generation with User Information and Pattern Dictionaries	Kutay Bölat et.al.	2411.03936	link
2024-11-06	Large Generative Model-assisted Talking-face Semantic Communication System	Feibo Jiang et.al.	2411.03876	null
2024-11-06	ROBIN: Robust and Invisible Watermarks for Diffusion Models with Adversarial Optimization	Huayang Huang et.al.	2411.03862	link
2024-11-06	Sub-DM:Subspace Diffusion Model with Orthogonal Decomposition for MRI Reconstruction	Yu Guan et.al.	2411.03758	link
2024-11-05	MME-Finance: A Multimodal Finance Benchmark for Expert-level Understanding and Reasoning	Ziliang Gan et.al.	2411.03314	null
2024-11-05	LLMs for Domain Generation Algorithm Detection	Reynier Leyva La O et.al.	2411.03307	null
2024-11-05	DiffLM: Controllable Synthetic Data Generation via Diffusion Language Models	Ying Zhou et.al.	2411.03250	null
2024-11-05	On Improved Conditioning Mechanisms and Pre-training Strategies for Diffusion Models	Tariq Berrada Ifriqi et.al.	2411.03177	null
2024-11-05	Unleashing the power of novel conditional generative approaches for new materials discovery	Lev Novitskiy et.al.	2411.03156	link
2024-11-05	Local Lesion Generation is Effective for Capsule Endoscopy Image Data Augmentation in a Limited Data Setting	Adrian B. Chłopowiec et.al.	2411.03098	null
2024-11-05	Gradient-Guided Conditional Diffusion Models for Private Image Reconstruction: Analyzing Adversarial Impacts of Differential Privacy and Denoising	Tao Huang et.al.	2411.03053	null
2024-11-05	GarVerseLOD: High-Fidelity 3D Garment Reconstruction from a Single In-the-Wild Image using a Dataset with Levels of Details	Zhongjin Luo et.al.	2411.03047	null
2024-11-05	Speaker Emotion Recognition: Leveraging Self-Supervised Models for Feature Extraction Using Wav2Vec2 and HuBERT	Pourya Jafarzadeh et.al.	2411.02964	null
2024-11-05	IMUDiffusion: A Diffusion Model for Multivariate Time Series Synthetisation for Inertial Motion Capturing Systems	Heiko Oppel et.al.	2411.02954	null
2024-11-05	LDPM: Towards undersampled MRI reconstruction with MR-VAE and Latent Diffusion Prior	Xingjian Tang et.al.	2411.02951	null
2024-11-05	A scalable generative model for dynamical system reconstruction from neuroimaging data	Eric Volkmann et.al.	2411.02949	link
2024-11-05	Exploring the Interplay Between Video Generation and World Models in Autonomous Driving: A Survey	Ao Fu et.al.	2411.02914	null
2024-11-05	The Unreasonable Effectiveness of LLMs for Query Optimization	Peter Akioyamen et.al.	2411.02862	link
2024-11-05	ADOPT: Modified Adam Can Converge with Any $β_2$ with the Optimal Rate	Shohei Taniguchi et.al.	2411.02853	link
2024-11-04	Training-free Regional Prompting for Diffusion Transformers	Anthony Chen et.al.	2411.02395	link
2024-11-04	How Far is Video Generation from World Model: A Physical Law Perspective	Bingyi Kang et.al.	2411.02385	null
2024-11-04	Virgo Filaments IV: Using WISE to Measure the Modification of Star-Forming Disks in the Extended Regions Around the Virgo Cluster	Kim Conger et.al.	2411.02352	null
2024-11-04	Diffusion-based Generative Multicasting with Intent-aware Semantic Decomposition	Xinkai Liu et.al.	2411.02334	null
2024-11-05	PPLLaVA: Varied Video Sequence Understanding With Prompt Guidance	Ruyang Liu et.al.	2411.02327	link
2024-11-04	LayerDAG: A Layerwise Autoregressive Diffusion Model for Directed Acyclic Graph Generation	Mufei Li et.al.	2411.02322	link
2024-11-04	CRMArena: Understanding the Capacity of LLM Agents to Perform Professional CRM Tasks in Realistic Environments	Kung-Hsiang Huang et.al.	2411.02305	link
2024-11-04	Hunyuan3D-1.0: A Unified Framework for Text-to-3D and Image-to-3D Generation	Xianghui Yang et.al.	2411.02293	null
2024-11-04	Counterfactual Explanations via Riemannian Latent Space Traversal	Paraskevas Pegios et.al.	2411.02259	null
2024-11-04	FewViewGS: Gaussian Splatting with Few View Matching and Multi-stage Training	Ruihong Yin et.al.	2411.02229	null
2024-11-04	Recursive Learning of Asymptotic Variational Objectives	Alessandro Mastrototaro et.al.	2411.02217	null
2024-11-04	Digi2Real: Bridging the Realism Gap in Synthetic Data Face Recognition via Foundation Models	Anjith George et.al.	2411.02188	null
2024-11-04	Touch-to-Touch Translation -- Learning the Mapping Between Heterogeneous Tactile Sensing Technologies	Francesco Grella et.al.	2411.02187	null
2024-11-04	CleAR: Robust Context-Guided Generative Lighting Estimation for Mobile Augmented Reality	Yiqin Zhao et.al.	2411.02179	null
2024-11-04	CryptoEL: A Novel Experiential Learning Tool for Enhancing K-12 Cryptography Education	Pranathi Rayavaram et.al.	2411.02143	null
2024-10-31	Bridging Geometric States via Geometric Diffusion Bridge	Shengjie Luo et.al.	2410.24220	null
2024-10-31	Enhancing Motion in Text-to-Video Generation with Decomposed Encoding and Conditioning	Penghui Ruan et.al.	2410.24219	link
2024-10-31	DiffPano: Scalable and Consistent Text to Panorama Generation with Spherical Epipolar-Aware Diffusion	Weicai Ye et.al.	2410.24203	link
2024-10-31	Multi-Attribute Linguistic Tuning for Controlled Paraphrase Generation	Mohamed Elgaar et.al.	2410.24199	null
2024-10-31	Generative modelling for mass-mapping with fast uncertainty quantification	Jessica J. Whitney et.al.	2410.24197	link
2024-10-31	AR-Pro: Counterfactual Explanations for Anomaly Repair with Formal Properties	Xiayan Ji et.al.	2410.24178	link
2024-10-31	Redefining in Dictionary: Towards a Enhanced Semantic Understanding of Creative Generation	Fu Feng et.al.	2410.24160	null
2024-10-31	Scaling Concept With Text-Guided Diffusion Models	Chao Huang et.al.	2410.24151	null
2024-10-31	Repository-Level Compositional Code Translation and Validation	Ali Reza Ibrahimzada et.al.	2410.24117	link
2024-10-31	Extended electrochemical monitoring of biomolecular binding using commercially available, reusable electrodes in microliter volumes	Jeremy Mendez et.al.	2410.24110	null
2024-10-31	Sparsh: Self-supervised touch representations for vision-based tactile sensing	Carolina Higuera et.al.	2410.24090	null
2024-10-31	Understanding Generalizability of Diffusion Models Requires Rethinking the Hidden Gaussian Structure	Xiang Li et.al.	2410.24060	link
2024-10-31	TPC: Test-time Procrustes Calibration for Diffusion-based Human Image Animation	Sunjae Yoon et.al.	2410.24037	null
2024-10-31	Unveiling Synthetic Faces: How Synthetic Datasets Can Expose Real Identities	Hatef Otroshi Shahreza et.al.	2410.24015	null
2024-10-31	DiffPAD: Denoising Diffusion-based Adversarial Patch Decontamination	Jia Fu et.al.	2410.24006	link
2024-10-30	ReferEverything: Towards Segmenting Everything We Can Speak of in Videos	Anurag Bagchi et.al.	2410.23287	null
2024-10-30	Provable acceleration for diffusion models under minimal assumptions	Gen Li et.al.	2410.23285	null
2024-10-30	RelationBooth: Towards Relation-Aware Customized Object Generation	Qingyu Shi et.al.	2410.23280	null
2024-10-30	SlowFast-VGen: Slow-Fast Learning for Action-Driven Long Video Generation	Yining Hong et.al.	2410.23277	null
2024-10-30	Multi-student Diffusion Distillation for Better One-step Generators	Yanke Song et.al.	2410.23274	null
2024-10-30	ReaWristic: Remote Touch Sensation to Fingers from a Wristband via Visually Augmented Electro-Tactile Feedback	Yudai Tanaka et.al.	2410.23193	null
2024-10-30	Real-Time Personalization for LLM-based Recommendation with Customized In-Context Learning	Keqin Bao et.al.	2410.23136	link
2024-10-30	Educating for Hardware Specialization in the Chiplet Era: A Path for the HPC Community	Kazutomo Yoshii et.al.	2410.23127	null
2024-10-30	CausalDiff: Causality-Inspired Disentanglement via Diffusion Model for Adversarial Defense	Mingkun Zhang et.al.	2410.23091	link
2024-10-30	General Bayesian quantile regression for counts via generative modeling	Yuta Yamauchi et.al.	2410.23081	null
2024-10-30	Controlling Language and Diffusion Models by Transporting Activations	Pau Rodriguez et.al.	2410.23054	link
2024-10-30	Dispersion kinks from electronic correlations in an unconventional iron-based superconductor	Ming-Hua Chang et.al.	2410.23044	null
2024-10-30	Improving Musical Accompaniment Co-creation via Diffusion Transformers	Javier Nistal et.al.	2410.23005	null
2024-10-30	DexGraspNet 2.0: Learning Generative Dexterous Grasping in Large-scale Synthetic Cluttered Scenes	Jialiang Zhang et.al.	2410.23004	null
2024-10-30	LumiSculpt: A Consistency Lighting Control Network for Video Generation	Yuxin Zhang et.al.	2410.22979	null
2024-10-29	CaStL: Constraints as Specifications through LLM Translation for Long-Horizon Task and Motion Planning	Weihang Guo et.al.	2410.22225	null
2024-10-29	A Gaussian Process Generative Model for QCD Equation of State	Jiaxuan Gong et.al.	2410.22160	null
2024-10-29	Capacity Control is an Effective Memorization Mitigation Mechanism in Text-Conditional Diffusion Models	Raman Dutt et.al.	2410.22149	link
2024-10-29	AmpleGCG-Plus: A Strong Generative Model of Adversarial Suffixes to Jailbreak LLMs with Higher Success Rates in Fewer Attempts	Vishal Kumar et.al.	2410.22143	null
2024-10-29	Infrared photometry with InGaAs detectors: First light with SPECULOOS	Peter P. Pedersen et.al.	2410.22140	link
2024-10-29	SimRec: Mitigating the Cold-Start Problem in Sequential Recommendation by Integrating Item Similarity	Shaked Brody et.al.	2410.22136	link
2024-10-29	Protecting Privacy in Multimodal Large Language Models with MLLMU-Bench	Zheyuan Liu et.al.	2410.22108	link
2024-10-29	Variational inference for pile-up removal at hadron colliders with diffusion models	Malte Algren et.al.	2410.22074	null
2024-10-29	PACA: Perspective-Aware Cross-Attention Representation for Zero-Shot Scene Rearrangement	Shutong Jin et.al.	2410.22059	null
2024-10-29	Dual Conditional Diffusion Models for Sequential Recommendation	Hongtao Huang et.al.	2410.21967	null
2024-10-29	PrefPaint: Aligning Image Inpainting Diffusion Model with Human Preference	Kendong Liu et.al.	2410.21966	null
2024-10-29	CT to PET Translation: A Large-scale Dataset and Domain-Knowledge-Guided Diffusion Approach	Dac Thai Nguyen et.al.	2410.21932	link
2024-10-29	Guided Diffusion-based Counterfactual Augmentation for Robust Session-based Recommendation	Muskan Gupta et.al.	2410.21892	null
2024-10-29	On the study of the limit cycles for a class of population models with time-varying factors	Renhao Tian et.al.	2410.21848	null
2024-10-29	Diffusion as Reasoning: Enhancing Object Goal Navigation with LLM-Biased Diffusion Model	Yiming Ji et.al.	2410.21842	null
2024-10-28	On Inductive Biases That Enable Generalization of Diffusion Transformers	Jie An et.al.	2410.21273	link
2024-10-28	EoRA: Training-free Compensation for Compressed LLM with Eigenspace Low-Rank Approximation	Shih-Yang Liu et.al.	2410.21271	null
2024-10-28	LARP: Tokenizing Videos with a Learned Autoregressive Generative Prior	Hanyu Wang et.al.	2410.21264	null
2024-10-28	One-Step Diffusion Policy: Fast Visuomotor Policies via Diffusion Distillation	Zhendong Wang et.al.	2410.21257	null
2024-10-28	On learning higher-order cumulants in diffusion models	Gert Aarts et.al.	2410.21212	null
2024-10-28	The VSPEC Collection: A suite of utilities to model spectroscopic phase curves of 3D exoplanet atmospheres in the presence of stellar variability	Ted M Johnson et.al.	2410.21190	null
2024-10-28	Trajectory Flow Matching with Applications to Clinical Time Series Modeling	Xi Zhang et.al.	2410.21154	link
2024-10-28	Synthetica: Large Scale Synthetic Data for Robot Perception	Ritvik Singh et.al.	2410.21153	null
2024-10-28	Extrapolating Prospective Glaucoma Fundus Images through Diffusion Model in Irregular Longitudinal Sequences	Zhihao Zhao et.al.	2410.21130	null
2024-10-28	Shallow Diffuse: Robust and Invisible Watermarking through Low-Dimensional Subspaces in Diffusion Models	Wenda Li et.al.	2410.21088	link
2024-10-28	Federated Time Series Generation on Feature and Temporally Misaligned Data	Chenrui Fan et.al.	2410.21072	null
2024-10-28	Kandinsky 3: Text-to-Image Synthesis for Multifunctional Generative Framework	Vladimir Arkhipkin et.al.	2410.21061	link
2024-10-28	Beyond Autoregression: Fast LLMs via Self-Distillation Through Time	Justin Deschenaux et.al.	2410.21035	link
2024-10-29	EEG-Driven 3D Object Reconstruction with Color Consistency and Diffusion Prior	Xin Xiang et.al.	2410.20981	null
2024-10-28	MovieCharacter: A Tuning-Free Framework for Controllable Character Video Synthesis	Di Qiu et.al.	2410.20974	null
2024-10-25	Model merging with SVD to tie the Knots	George Stoica et.al.	2410.19735	link
2024-10-25	Adversarial Environment Design via Regret-Guided Diffusion Models	Hojun Chung et.al.	2410.19715	null
2024-10-25	Perception, Control and Hardware for In-Hand Slip-Aware Object Manipulation with Parallel Grippers	Gabriel Arslan Waltersson et.al.	2410.19660	null
2024-10-25	DiffGS: Functional Gaussian Splatting Diffusion	Junsheng Zhou et.al.	2410.19657	null
2024-10-25	VARS: Vision-based Assessment of Risk in Security Systems	Pranav Gupta et.al.	2410.19642	null
2024-10-25	Diffusion models for lattice gauge field simulations	Qianteng Zhu et.al.	2410.19602	null
2024-10-25	Energy Efficient Dual Designs of FeFET-Based Analog In-Memory Computing with Inherent Shift-Add Capability	Zeyu Yang et.al.	2410.19593	null
2024-10-25	Hybrid Memetic Search for Electric Vehicle Routing with Time Windows, Simultaneous Pickup-Delivery, and Partial Recharges	Zubin Zheng et.al.	2410.19580	null
2024-10-25	Utilizing Image Transforms and Diffusion Models for Generative Modeling of Short and Long Time Series	Ilan Naiman et.al.	2410.19538	null
2024-10-25	Ensemble Data Assimilation for Particle-based Methods	Marius Duvillard et.al.	2410.19525	null
2024-10-25	Marked Temporal Bayesian Flow Point Processes	Hui Chen et.al.	2410.19512	null
2024-10-25	EDGE: Enhanced Grounded GUI Understanding with Enriched Multi-Granularity Synthetic Data	Xuetian Chen et.al.	2410.19461	null
2024-10-28	NeuroClips: Towards High-fidelity and Smooth fMRI-to-Video Reconstruction	Zixuan Gong et.al.	2410.19452	link
2024-10-25	Learned Reference-based Diffusion Sampling for multi-modal distributions	Maxence Noble et.al.	2410.19449	null
2024-10-25	Generative Diffusion Models for Sequential Recommendations	Sharare Zolghadr et.al.	2410.19429	null
2024-10-24	Framer: Interactive Frame Interpolation	Wen Wang et.al.	2410.18978	null
2024-10-24	MotionCLR: Motion Generation and Training-free Editing via Understanding Attention Mechanisms	Ling-Hao Chen et.al.	2410.18977	null
2024-10-24	Unbounded: A Generative Infinite Game of Character Life Simulation	Jialu Li et.al.	2410.18975	null
2024-10-24	3D-Adapter: Geometry-Consistent Multi-View Diffusion for High-Quality 3D Generation	Hansheng Chen et.al.	2410.18974	link
2024-10-24	On the Crucial Role of Initialization for Matrix Factorization	Bingcong Li et.al.	2410.18965	null
2024-10-24	Stable Consistency Tuning: Understanding and Improving Consistency Models	Fu-Yun Wang et.al.	2410.18958	link
2024-10-24	Generation of synthetic financial time series by diffusion models	Tomonori Takahashi et.al.	2410.18897	null
2024-10-24	Diff-Instruct++: Training One-step Text-to-image Generator Model to Align with Human Preferences	Weijian Luo et.al.	2410.18881	null
2024-10-24	The Cat and Mouse Game: The Ongoing Arms Race Between Diffusion Models and Detection Methods	Linda Laurier et.al.	2410.18866	null
2024-10-24	From Efficiency to Equity: Measuring Fairness in Preference Learning	Shreeyash Gowaikar et.al.	2410.18841	null
2024-10-24	From English-Centric to Effective Bilingual: LLMs with Custom Tokenizers for Underrepresented Languages	Artur Kiulian et.al.	2410.18836	null
2024-10-24	Multi-Scale Diffusion: Enhancing Spatial Layout in High-Resolution Panoramic Image Generation	Xiaoyu Zhang et.al.	2410.18830	null
2024-10-24	Towards Visual Text Design Transfer Across Languages	Yejin Choi et.al.	2410.18823	null
2024-10-24	Fast constrained sampling in pre-trained diffusion models	Alexandros Graikos et.al.	2410.18804	null
2024-10-24	Large Generative AI Models meet Open Networks for 6G: Integration, Platform, and Monetization	Peizheng Li et.al.	2410.18790	null
2024-10-23	DynamicCity: Large-Scale LiDAR Generation from Dynamic Scenes	Hengwei Bian et.al.	2410.18084	null
2024-10-23	Prioritized Generative Replay	Renhao Wang et.al.	2410.18082	null
2024-10-23	WorldSimBench: Towards Video Generation Models as World Simulators	Yiran Qin et.al.	2410.18072	null
2024-10-23	TP-Eval: Tap Multimodal LLMs' Potential in Evaluation by Customizing Prompts	Yuxuan Xie et.al.	2410.18071	null
2024-10-23	Training Free Guided Flow Matching with Optimal Control	Luran Wang et.al.	2410.18070	null
2024-10-23	Spectrally shaped THz pulses from tapered dielectric waveguides	Karel Peetermans et.al.	2410.17975	null
2024-10-23	Optical Generative Models	Shiqi Chen et.al.	2410.17970	null
2024-10-23	A Wavelet Diffusion GAN for Image Super-Resolution	Lorenzo Aloisi et.al.	2410.17966	null
2024-10-23	Addressing Asynchronicity in Clinical Multimodal Fusion via Individualized Chest X-ray Generation	Wenfang Yao et.al.	2410.17918	link
2024-10-23	regAL: Python Package for Active Learning of Regression Problems	Elizaveta Surzhikova et.al.	2410.17917	null
2024-10-23	Scaling Diffusion Language Models via Adaptation from Autoregressive Models	Shansan Gong et.al.	2410.17891	link
2024-10-23	Non-intrusive Speech Quality Assessment with Diffusion Models Trained on Clean Speech	Danilo de Oliveira et.al.	2410.17834	null
2024-10-23	PGDiffSeg: Prior-Guided Denoising Diffusion Model with Parameter-Shared Attention for Breast Cancer Segmentation	Feiyan Feng et.al.	2410.17812	null
2024-10-23	GenUDC: High Quality 3D Mesh Generation with Unsigned Dual Contouring Representation	Ruowei Wang et.al.	2410.17802	link
2024-10-23	Regularized autoregressive modeling and its application to audio signal declipping	Ondřej Mokrý et.al.	2410.17790	link
2024-10-22	Large Language Models Empowered Personalized Web Agents	Hongru Cai et.al.	2410.17236	null
2024-10-22	Creativity in AI: Progresses and Challenges	Mete Ismayilzada et.al.	2410.17218	link
2024-10-22	Audio-to-Score Conversion Model Based on Whisper methodology	Hongyao Zhang et.al.	2410.17209	null
2024-10-22	Reinforcement learning on structure-conditioned categorical diffusion for protein inverse folding	Yasha Ektefaie et.al.	2410.17173	link
2024-10-22	Performance of the CMS high-level trigger during LHC Run 2	CMS Collaboration et.al.	2410.17038	null
2024-10-22	Hybrid Generative AI for De Novo Design of Co-Crystals with Enhanced Tabletability	Nina Gubina et.al.	2410.17005	link
2024-10-22	DiP-GO: A Diffusion Pruner via Few-step Gradient Optimization	Haowei Zhu et.al.	2410.16942	null
2024-10-22	Hierarchical Clustering for Conditional Diffusion in Image Generation	Jorge da Silva Goncalves et.al.	2410.16910	link
2024-10-22	Bayes without Underfitting: Fully Correlated Deep Learning Posteriors via Alternating Projections	Marco Miani et.al.	2410.16901	null
2024-10-22	VistaDream: Sampling multiview consistent images for single-view scene reconstruction	Haiping Wang et.al.	2410.16892	null
2024-10-22	CK4Gen: A Knowledge Distillation Framework for Generating High-Utility Synthetic Survival Datasets in Healthcare	Nicholas I-Hsien Kuo et.al.	2410.16872	null
2024-10-22	MPDS: A Movie Posters Dataset for Image Generation with Diffusion Model	Meng Xu et.al.	2410.16840	null
2024-10-22	Bridging Search and Recommendation in Generative Retrieval: Does One Task Help the Other?	Gustavo Penha et.al.	2410.16823	null
2024-10-22	Evaluating the Effectiveness of Attack-Agnostic Features for Morphing Attack Detection	Laurent Colbois et.al.	2410.16802	link
2024-10-22	One-Step Diffusion Distillation through Score Implicit Matching	Weijian Luo et.al.	2410.16794	link
2024-10-21	MvDrag3D: Drag-based Creative 3D Editing via Multi-view Generation-Reconstruction Priors	Honghua Chen et.al.	2410.16272	null
2024-10-21	Agent-to-Sim: Learning Interactive Behavior Models from Casual Longitudinal Videos	Gengshan Yang et.al.	2410.16259	null
2024-10-21	Distribution Learning with Valid Outputs Beyond the Worst-Case	Nick Rittler et.al.	2410.16253	null
2024-10-21	Building A Coding Assistant via the Retrieval-Augmented Language Model	Xinze Li et.al.	2410.16229	link
2024-10-21	CiteClick: A Browser Extension for Real-Time Scholar Citation Tracking	Nishat Raihan et.al.	2410.16211	null
2024-10-21	A Framework for Evaluating Predictive Models Using Synthetic Image Covariates and Longitudinal Data	Simon Deltadahl et.al.	2410.16177	null
2024-10-22	Warped Diffusion: Solving Video Inverse Problems with Image Diffusion Models	Giannis Daras et.al.	2410.16152	null
2024-10-21	Modelling Structured Data Learning with Restricted Boltzmann Machines in the Teacher-Student Setting	Robin Thériault et.al.	2410.16150	null
2024-10-21	SeaDAG: Semi-autoregressive Diffusion for Conditional Directed Acyclic Graph Generation	Xinyi Zhou et.al.	2410.16119	null
2024-10-21	Critical Example Mining for Vehicle Trajectory Prediction using Flow-based Generative Models	Zhezhang Ding et.al.	2410.16083	null
2024-10-21	Continuous Speech Synthesis using per-token Latent Diffusion	Arnon Turetzky et.al.	2410.16048	null
2024-10-21	Some generalizations of the convective model of jet generation	S. N. Artekha et.al.	2410.16035	null
2024-10-21	ComPO: Community Preferences for Language Model Personalization	Sachin Kumar et.al.	2410.16027	null
2024-10-21	Massimo: Public Queue Monitoring and Management using Mass-Spring Model	Abhijeet Kumar et.al.	2410.16012	null
2024-10-21	AI-Driven Innovations in Modern Cloud Computing	Animesh Kumar et.al.	2410.15960	null
2024-10-18	BiGR: Harnessing Binary Latent Codes for Image Generation and Improved Visual Representation Capabilities	Shaozhe Hao et.al.	2410.14672	link
2024-10-18	How Does Data Diversity Shape the Weight Landscape of Neural Networks?	Yang Ba et.al.	2410.14602	null
2024-10-18	Bayesian Multi-wavelength Imaging of the LMC SN1987A with SRG/eROSITA	Vincent Eberle et.al.	2410.14599	null
2024-10-18	Neuro-Symbolic Traders: Assessing the Wisdom of AI Crowds in Markets	Namid R. Stillman et.al.	2410.14587	null
2024-10-18	Reimagining partial thickness keratoplasty: An eye mountable robot for autonomous big bubble needle insertion	Y. Wang et.al.	2410.14577	null
2024-10-18	Multi-modal Pose Diffuser: A Multimodal Generative Conditional Pose Prior	Calvin-Khang Ta et.al.	2410.14540	null
2024-10-18	Blockchain-Based Trust and Transparency in Airline Reservation Systems using Microservices Architecture	Biman Barua et.al.	2410.14518	null
2024-10-18	LEAD: Latent Realignment for Human Motion Diffusion	Nefeli Andreou et.al.	2410.14508	null
2024-10-18	Reinforcement Learning in Non-Markov Market-Making	Luca Lalor et.al.	2410.14504	null
2024-10-18	Data-driven topology design with persistent homology for enhancing population diversity	Taisei Kii et.al.	2410.14496	null
2024-10-18	ANT: Adaptive Noise Schedule for Time Series Diffusion Models	Seunghan Lee et.al.	2410.14488	link
2024-10-21	CaTs and DAGs: Integrating Directed Acyclic Graphs with Transformers and Fully-Connected Neural Networks for Causally Constrained Predictions	Matthew J. Vowels et.al.	2410.14485	link
2024-10-18	DRL Optimization Trajectory Generation via Wireless Network Intent-Guided Diffusion Models for Optimizing Resource Allocation	Junjie Wu et.al.	2410.14481	null
2024-10-18	Flow-based Sampling for Entanglement Entropy and the Machine Learning of Defects	Andrea Bulgarelli et.al.	2410.14466	null
2024-10-18	FashionR2R: Texture-preserving Rendered-to-Real Image Translation with Diffusion Models	Rui Hu et.al.	2410.14429	null
2024-10-17	Fluid: Scaling Autoregressive Text-to-image Generative Models with Continuous Tokens	Lijie Fan et.al.	2410.13863	null
2024-10-17	Diffusing States and Matching Scores: A New Framework for Imitation Learning	Runzhe Wu et.al.	2410.13855	link
2024-10-17	Influence Functions for Scalable Data Attribution in Diffusion Models	Bruno Mlodozeniec et.al.	2410.13850	null
2024-10-17	VidPanos: Generative Panoramic Videos from Casual Panning Videos	Jingwei Ma et.al.	2410.13832	null
2024-10-17	DreamVideo-2: Zero-Shot Subject-Driven Video Customization with Precise Motion Control	Yujie Wei et.al.	2410.13830	null
2024-10-17	Deep Generative Models Unveil Patterns in Medical Images Through Vision-Language Conditioning	Xiaodan Xing et.al.	2410.13823	link
2024-10-17	ConsisSR: Delving Deep into Consistency in Diffusion-based Image Super-Resolution	Junhao Gu et.al.	2410.13807	null
2024-10-17	Probing the Latent Hierarchical Structure of Data via Diffusion Models	Antonio Sclocchi et.al.	2410.13770	null
2024-10-17	Theory on Score-Mismatched Diffusion Models and Zero-Shot Conditional Samplers	Yuchen Liang et.al.	2410.13746	null
2024-10-17	Improved Convergence Rate for Diffusion Probabilistic Models	Gen Li et.al.	2410.13738	null
2024-10-17	Optimizing Probabilistic Conformal Prediction with Vectorized Non-Conformity Scores	Minxing Zheng et.al.	2410.13735	null
2024-10-18	DAWN: Dynamic Frame Avatar with Non-autoregressive Diffusion Framework for Talking Head Video Generation	Hanbo Cheng et.al.	2410.13726	link
2024-10-17	Movie Gen: A Cast of Media Foundation Models	Adam Polyak et.al.	2410.13720	link
2024-10-18	Diffusion Curriculum: Synthetic-to-Real Generative Curriculum Learning via Image-Guided Diffusion	Yijun Liang et.al.	2410.13674	link
2024-10-17	Fine-Tuning Discrete Diffusion Models via Reward Optimization with Applications to DNA and Protein Design	Chenyu Wang et.al.	2410.13643	link
2024-10-16	Geometry-Aware Generative Autoencoders for Warped Riemannian Metric Learning and Generative Modeling on Data Manifolds	Xingzhi Sun et.al.	2410.12779	null
2024-10-16	Meta-Unlearning on Diffusion Models: Preventing Relearning Unlearned Concepts	Hongcheng Gao et.al.	2410.12777	link
2024-10-16	SAFREE: Training-Free and Adaptive Guard for Safe Text-to-Image And Video Generation	Jaehong Yoon et.al.	2410.12761	null
2024-10-16	Signature of Vertical Mixing in Hydrogen-dominated Exoplanet Atmospheres	Vikas Soni et.al.	2410.12737	null
2024-10-16	Counterfactual Generative Modeling with Variational Causal Inference	Yulun Wu et.al.	2410.12730	link
2024-10-16	FusionLLM: A Decentralized LLM Training System on Geo-distributed GPUs with Adaptive Compression	Zhenheng Tang et.al.	2410.12707	null
2024-10-16	Embedding an Ethical Mind: Aligning Text-to-Image Synthesis via Lightweight Value Optimization	Xingqi Wang et.al.	2410.12700	link
2024-10-16	AdaptiveDrag: Semantic-Driven Dragging on Diffusion-Based Image Editing	DuoSheng Chen et.al.	2410.12696	link
2024-10-16	3DIS: Depth-Driven Decoupled Instance Synthesis for Text-to-Image Generation	Dewei Zhou et.al.	2410.12669	link
2024-10-16	Towards Designing Scalable Quantum-Enhanced Generative Networks for Neutrino Physics Experiments with Liquid Argon Time Projection Chambers	Andrea Delgado et.al.	2410.12650	null
2024-10-16	A Robo-Advisor System: expected utility modeling via pairwise comparisons	Bo Chen et.al.	2410.12570	null
2024-10-16	One Step Diffusion via Shortcut Models	Kevin Frans et.al.	2410.12557	link
2024-10-16	Disentangling data distribution for Federated Learning	Xinyuan Zhao et.al.	2410.12530	null
2024-10-16	Shaping a Stabilized Video by Mitigating Unintended Changes for Concept-Augmented Video Editing	Mingce Guo et.al.	2410.12526	null
2024-10-16	MING: A Functional Approach to Learning Molecular Generative Models	Van Khoa Nguyen et.al.	2410.12522	null
2024-10-15	High-Resolution Frame Interpolation with Patch-based Cascaded Diffusion	Junhwa Hur et.al.	2410.11838	null
2024-10-15	On the Effectiveness of Dataset Alignment for Fake Image Detection	Anirudh Sundara Rajan et.al.	2410.11835	null
2024-10-15	Bayesian Experimental Design via Contrastive Diffusions	Jacopo Iollo et.al.	2410.11826	link
2024-10-15	KITTEN: A Knowledge-Intensive Evaluation of Image Generation on Visual Entities	Hsin-Ping Huang et.al.	2410.11824	null
2024-10-15	Improving Long-Text Alignment for Text-to-Image Diffusion Models	Luping Liu et.al.	2410.11817	link
2024-10-15	SGEdit: Bridging LLM with Text2Image Generative Model for Scene Graph-based Image Editing	Zhiyuan Zhang et.al.	2410.11815	null
2024-10-16	Efficient Diffusion Models: A Comprehensive Survey from Principles to Practices	Zhiyuan Ma et.al.	2410.11795	null
2024-10-15	G-Designer: Architecting Multi-agent Communication Topologies via Graph Neural Networks	Guibin Zhang et.al.	2410.11782	null
2024-10-15	Technical Report of 1:10 Scale Autonomous Vehicle Robot	Amirhossein Kheiri Holighi et.al.	2410.11746	null
2024-10-15	Probabilistic Principles for Biophysics and Neuroscience: Entropy Production, Bayesian Mechanics & the Free-Energy Principle	Lancelot Da Costa et.al.	2410.11735	null
2024-10-15	Patch-Based Diffusion Models Beat Whole-Image Models for Mismatched Distribution Inverse Problems	Jason Hu et.al.	2410.11730	null
2024-10-15	Parameter estimation of structural dynamics with neural operators enabled surrogate modeling	Mingyuan Zhou et.al.	2410.11712	null
2024-10-15	Findings of the WMT 2024 Shared Task on Chat Translation	Wafaa Mohammed et.al.	2410.11624	null
2024-10-15	DeformPAM: Data-Efficient Learning for Long-horizon Deformable Object Manipulation via Preference-based Action Alignment	Wendi Chen et.al.	2410.11584	link
2024-10-15	A Data-Driven Aggressive Autonomous Racing Framework Utilizing Local Trajectory Planning with Velocity Prediction	Zhouheng Li et.al.	2410.11570	link
2024-10-14	Tex4D: Zero-shot 4D Scene Texturing with Video Diffusion Models	Jingzhi Bao et.al.	2410.10821	link
2024-10-15	TemporalBench: Benchmarking Fine-grained Temporal Understanding for Multimodal Video Models	Mu Cai et.al.	2410.10818	link
2024-10-14	LVD-2M: A Long-take Video Dataset with Temporally Dense Captions	Tianwei Xiong et.al.	2410.10816	link
2024-10-14	Depth Any Video with Scalable Synthetic Data	Honghui Yang et.al.	2410.10815	link
2024-10-14	HART: Efficient Visual Generation with Hybrid Autoregressive Transformer	Haotian Tang et.al.	2410.10812	link
2024-10-14	TrajDiffuse: A Conditional Diffusion Model for Environment-Aware Trajectory Prediction	Qingze et.al.	2410.10804	link
2024-10-14	Boosting Camera Motion Control for Video Diffusion Transformers	Soon Yau Cheong et.al.	2410.10802	null
2024-10-14	Semantic Image Inversion and Editing using Rectified Stochastic Differential Equations	Litu Rout et.al.	2410.10792	null
2024-10-14	ControlMM: Controllable Masked Motion Generation	Ekkasit Pinyoanuntapong et.al.	2410.10780	null
2024-10-14	Adaptive Diffusion Terrain Generator for Autonomous Uneven Terrain Navigation	Youwei Yu et.al.	2410.10766	link
2024-10-14	DragEntity: Trajectory Guided Video Generation using Entity and Positional Relationships	Zhang Wan et.al.	2410.10751	null
2024-10-14	CosForce: A Force-Based General Model for Simulating Pedestrian Anticipation and Reaction Mechanisms	Jinghui Wang et.al.	2410.10746	null
2024-10-14	FlexGen: Flexible Multi-View Generation from Text and Image Inputs	Xinli Xu et.al.	2410.10745	null
2024-10-14	Deep Compression Autoencoder for Efficient High-Resolution Diffusion Models	Junyu Chen et.al.	2410.10733	link
2024-10-14	Large Language Models Are Active Critics in NLG Evaluation	Shuying Xu et.al.	2410.10724	null
2024-10-11	SceneCraft: Layout-Guided 3D Scene Generation	Xiuyu Yang et.al.	2410.09049	link
2024-10-11	Linear Convergence of Diffusion Models Under the Manifold Hypothesis	Peter Potaptchik et.al.	2410.09046	null
2024-10-11	PEAR: A Robust and Flexible Automation Framework for Ptychography Enabled by Multiple Large Language Model Agents	Xiangyu Yin et.al.	2410.09034	link
2024-10-11	Semantic Score Distillation Sampling for Compositional Text-to-3D Generation	Ling Yang et.al.	2410.09009	link
2024-10-11	WaveDiffusion: Exploring Full Waveform Inversion via Joint Diffusion in the Latent Space	Hanchen Wang et.al.	2410.09002	null
2024-10-11	Maximizing the Potential of Synthetic Data: Insights from Random Matrix Theory	Aymane El Firdoussi et.al.	2410.08942	null
2024-10-11	DiffPO: A causal diffusion model for learning distributions of potential outcomes	Yuchen Ma et.al.	2410.08924	null
2024-10-11	An End-to-End Deep Learning Method for Solving Nonlocal Allen-Cahn and Cahn-Hilliard Phase-Field Models	Yuwei Geng et.al.	2410.08914	null
2024-10-11	Conditional Generative Models for Contrast-Enhanced Synthesis of T1w and T1 Maps in Brain MRI	Moritz Piening et.al.	2410.08894	link
2024-10-11	MATCH: Model-Aware TVM-based Compilation for Heterogeneous Edge Devices	Mohamed Amine Hamdi et.al.	2410.08855	link
2024-10-14	LIME-Eval: Rethinking Low-light Image Enhancement Evaluation via Object Detection	Mingjia Li et.al.	2410.08810	link
2024-10-11	Bad Neighbors: On Understanding VPN Provider Networks	Teemu Rytilahti et.al.	2410.08737	link
2024-10-11	5G as Enabler for Industrie 4.0 Use Cases: Challenges and Concepts	M. Gundall et.al.	2410.08726	null
2024-10-11	Investigating Human-Computer Interaction and Visual Comprehension in Text Generation Process of Natural Language Generation Models	Yunchao Wang et.al.	2410.08723	null
2024-10-11	Impact of Surface Reflections in Maritime Obstacle Detection	Samed Yalçın et.al.	2410.08713	link
2024-10-10	LatteCLIP: Unsupervised CLIP Fine-Tuning via LMM-Synthetic Texts	Anh-Quan Cao et.al.	2410.08211	null
2024-10-10	DICE: Discrete Inversion Enabling Controllable Editing for Multinomial Diffusion and Masked Generative Models	Xiaoxiao He et.al.	2410.08207	null
2024-10-10	HybridBooth: Hybrid Prompt Inversion for Efficient Subject-Driven Generation	Shanyan Guan et.al.	2410.08192	null
2024-10-10	DifFRelight: Diffusion-Based Facial Performance Relighting	Mingming He et.al.	2410.08188	null
2024-10-10	RGM: Reconstructing High-fidelity 3D Car Assets with Relightable 3D-GS Generative Model from a Single Image	Xiaoxue Chen et.al.	2410.08181	null
2024-10-10	ZeroComp: Zero-shot Object Compositing from Image Intrinsics via Diffusion	Zitian Zhang et.al.	2410.08168	link
2024-10-10	DART: Denoising Autoregressive Transformer for Scalable Text-to-Image Generation	Jiatao Gu et.al.	2410.08159	null
2024-10-10	Progressive Autoregressive Video Diffusion Models	Desai Xie et.al.	2410.08151	link
2024-10-10	Steering Masked Discrete Diffusion Models via Discrete Denoising Posterior Prediction	Jarrid Rector-Brooks et.al.	2410.08134	null
2024-10-10	Robust AI-Generated Text Detection by Restricted Embeddings	Kristian Kuznetsov et.al.	2410.08113	link
2024-10-10	LiPO: LiDAR Inertial Odometry for ICP Comparison	Darwin Mick et.al.	2410.08097	null
2024-10-10	Unstable Unlearning: The Hidden Risk of Concept Resurgence in Diffusion Models	Vinith M. Suriyakumar et.al.	2410.08074	null
2024-10-10	Reversible Decoupling Network for Single Image Reflection Removal	Hao Zhao et.al.	2410.08063	link
2024-10-10	A Target-Aware Analysis of Data Augmentation for Hate Speech Detection	Camilla Casula et.al.	2410.08053	null
2024-10-10	LADIMO: Face Morph Generation through Biometric Template Inversion with Latent Diffusion	Marcel Grimmer et.al.	2410.07988	link
2024-10-09	IterComp: Iterative Composition-Aware Feedback Learning from Model Gallery for Text-to-Image Generation	Xinchen Zhang et.al.	2410.07171	link
2024-10-09	Sylber: Syllabic Embedding Representation of Speech from Raw Audio	Cheol Jun Cho et.al.	2410.07168	link
2024-10-09	AvatarGO: Zero-shot 4D Human-Object Interaction Generation and Animation	Yukang Cao et.al.	2410.07164	null
2024-10-09	InstructG2I: Synthesizing Images from Multimodal Attributed Graphs	Bowen Jin et.al.	2410.07157	link
2024-10-09	Trans4D: Realistic Geometry-Aware Transition for Compositional Text-to-4D Synthesis	Bohan Zeng et.al.	[2410.07155](http://arxiv.org/abs/

Name		Name	Last commit message	Last commit date
Latest commit History 2,528 Commits
.github		.github
assets		assets
docs		docs
.gitignore		.gitignore
CODE_OF_CONDUCT.md		CODE_OF_CONDUCT.md
LICENSE		LICENSE
README.md		README.md
config.yaml		config.yaml
daily_arxiv.py		daily_arxiv.py
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

Updated on 2025.05.29

PEFT

Text-to-Image Generation

About

Uh oh!

Releases

Packages

Languages

License

SKDDJ/cv-arxiv-daily

Folders and files

Latest commit

History

Repository files navigation

Updated on 2025.05.29

PEFT

Text-to-Image Generation

About

Resources

License

Code of conduct

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages