Stars
This repository contains the supplementary material for the paper "PRISMM-Bench: A Benchmark of Peer-Review Grounded Multimodal Inconsistencies"
HunyuanVideo: A Systematic Framework For Large Video Generation Model
text and image to video generation: CogVideoX (2024) and CogVideo (ICLR 2023)
Specification and documentation for Agent Skills
This repository contains the code for FLAWS (Faults Localization Across Writing in Science), a benchmark for evaluating LLMs on their capabilities in error identification and localization in scient…
One-for-All Multimodal Evaluation Toolkit Across Text, Image, Video, and Audio Tasks
AI-Powered Data Processing: Use LOTUS to process all of your datasets with LLMs and embeddings. Enjoy up to 1000x speedups with fast, accurate query processing, that's as simple as writing Pandas code
Convert documents to structured data effortlessly. Unstructured is open-source ETL solution for transforming complex documents into clean, structured formats for language models. Visit our website …
TAMO: reimagine Table representation as an independent Modality for LLMs
Dataset and evaluation suite enabling LLM instruction-following for scientific literature understanding.
"Paper2Slides: From Paper to Presentation in One Click"
Implementation of Nougat Neural Optical Understanding for Academic Documents
[TOSEM'25] The official GitHub page for the survey paper "A Survey on Large Language Models for Code Generation".
[NeurIPS'23 Spotlight] "Mind2Web: Towards a Generalist Agent for the Web" -- the first LLM-based web agent and benchmark for generalist web agents
[EMNLP2025] From Automation to Autonomy: A Survey on Large Language Models in Scientific Discovery
Paper list of agent for science
[ICLR 2025] <MOOSE-Chem: Large Language Models for Rediscovering Unseen Chemistry Scientific Hypotheses>
Code release for "Idiosyncrasies in Large Language Models"
Convert PDF to markdown + JSON quickly with high accuracy
Course Materials for Interpretability of Large Language Models (0368.4264) at Tel Aviv University
A game theoretic approach to explain the output of any machine learning model.
The Robustness of Multimodal LLMs in Reviewing Evidence from Tables and Charts
[ICLR2025 Spotlight] SVDQuant: Absorbing Outliers by Low-Rank Components for 4-Bit Diffusion Models
A beautiful, simple, clean, and responsive Jekyll theme for academics
AgentFlow: In-the-Flow Agentic System Optimization
An out-of-the-box inference acceleration engine for Diffusion and DiT models


