Skip to content
View xanhho's full-sized avatar

Block or report xanhho

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

This repository contains the supplementary material for the paper "PRISMM-Bench: A Benchmark of Peer-Review Grounded Multimodal Inconsistencies"

TypeScript 2 Updated Oct 5, 2025

HunyuanVideo: A Systematic Framework For Large Video Generation Model

Python 11,515 1,157 Updated Nov 21, 2025

text and image to video generation: CogVideoX (2024) and CogVideo (ICLR 2023)

Python 12,275 1,236 Updated Nov 4, 2025

Specification and documentation for Agent Skills

Python 3,840 174 Updated Dec 20, 2025

This repository contains the code for FLAWS (Faults Localization Across Writing in Science), a benchmark for evaluating LLMs on their capabilities in error identification and localization in scient…

Python 3 Updated Dec 15, 2025

One-for-All Multimodal Evaluation Toolkit Across Text, Image, Video, and Audio Tasks

Python 3,482 465 Updated Dec 18, 2025

AI-Powered Data Processing: Use LOTUS to process all of your datasets with LLMs and embeddings. Enjoy up to 1000x speedups with fast, accurate query processing, that's as simple as writing Pandas code

Python 1,512 132 Updated Dec 11, 2025

Convert documents to structured data effortlessly. Unstructured is open-source ETL solution for transforming complex documents into clean, structured formats for language models. Visit our website …

HTML 13,504 1,115 Updated Dec 25, 2025

TAMO: reimagine Table representation as an independent Modality for LLMs

Python 5 1 Updated May 23, 2025

Dataset and evaluation suite enabling LLM instruction-following for scientific literature understanding.

Python 47 6 Updated Mar 17, 2025

"Paper2Slides: From Paper to Presentation in One Click"

Python 2,554 348 Updated Dec 19, 2025

Implementation of Nougat Neural Optical Understanding for Academic Documents

Python 9,766 622 Updated Feb 21, 2025

[TOSEM'25] The official GitHub page for the survey paper "A Survey on Large Language Models for Code Generation".

178 8 Updated Jul 13, 2025

[NeurIPS'23 Spotlight] "Mind2Web: Towards a Generalist Agent for the Web" -- the first LLM-based web agent and benchmark for generalist web agents

Jupyter Notebook 927 118 Updated Nov 5, 2025

[EMNLP2025] From Automation to Autonomy: A Survey on Large Language Models in Scientific Discovery

282 36 Updated Nov 5, 2025

Paper list of agent for science

176 12 Updated Dec 8, 2025

[ICLR 2025] <MOOSE-Chem: Large Language Models for Rediscovering Unseen Chemistry Scientific Hypotheses>

Python 49 5 Updated Nov 12, 2025

Code release for "Idiosyncrasies in Large Language Models"

Python 51 7 Updated Jul 21, 2025
Python 1,486 158 Updated Nov 15, 2025

Convert PDF to markdown + JSON quickly with high accuracy

Python 30,582 2,086 Updated Dec 25, 2025

Course Materials for Interpretability of Large Language Models (0368.4264) at Tel Aviv University

272 20 Updated Dec 21, 2025

A game theoretic approach to explain the output of any machine learning model.

Jupyter Notebook 24,866 3,463 Updated Dec 11, 2025

The Robustness of Multimodal LLMs in Reviewing Evidence from Tables and Charts

Python 1 Updated Nov 17, 2025

[ICLR2025 Spotlight] SVDQuant: Absorbing Outliers by Low-Rank Components for 4-Bit Diffusion Models

Python 3,530 212 Updated Dec 27, 2025

A beautiful, simple, clean, and responsive Jekyll theme for academics

HTML 14,784 12,645 Updated Dec 24, 2025

AgentFlow: In-the-Flow Agentic System Optimization

Python 1,446 187 Updated Dec 17, 2025

An out-of-the-box inference acceleration engine for Diffusion and DiT models

C++ 60 2 Updated Mar 21, 2025
Next