Recent Stars

Jun 10, 2026

Tracking:2,121|Stars today:14|Star contributors:12

Browser extension that captures M3U8 HLS live streams and writes them to local disk

chandler-stimson/live-stream-downloader34556m

It uses browser network interception to collect segmented video chunks from live playlists and concatenates the fragments into a single file on the user's machine. The technique mirrors long-standing download utilities and browser extensions without introducing new primitives, confining adoption to occasional users who need to archive specific web streams.

AI Score
1
Breakout Potential
3
Novelty
2
Jeremy HowardJH#42|@JEREMYPHOWARD

Manage shared AI coding agent skills in Notion databases and sync them locally across tools

brianlovin/notion-skills671h

The CLI installs a Notion-backed store that maps database pages to SKILL.md files and symlinks them into agent-specific directories for Claude, Cursor and others. This centralizes collaboration on skills without git workflows, appealing mainly to teams already using multiple AI coding agents who want live editing and selective installs rather than broad developer adoption.

AI Score
9
Breakout Potential
5
Novelty
6
Geoffrey LittGL#1867|@GEOFFREYLITT

Downloader and interactive viewer for the Nymeria multimodal egocentric human motion dataset

facebookresearch/nymeria_dataset2041h

The repository supplies Python CLIs that fetch the 80 TB synchronized multi-device recordings, MHR/SMPL meshes, 3D object boxes, and language annotations, then render them together in a real-time viewer. Its unprecedented scale and first-of-kind egocentric multimodal capture make it a foundational resource for embodied-AI and AR/VR research groups rather than a general-purpose developer tool.

AI Score
9
Breakout Potential
4
Novelty
8
Andrew Carr 🤸AC#275|@ANDREW_N_CARR

Trajectory-refined distillation trainer for on-policy LLM distillation with teacher-guided rollout revision

louieworth/trd73h

The implementation builds on verl to prepare rollouts, apply KL losses at the trajectory level, and produce refined targets y r for both OPD and OPSD pipelines across math and code tasks. The method introduces a new output-refinement stage rather than further token-level loss tweaks, so it will be adopted mainly by research teams already running large-scale distillation experiments.

AI Score
10
Breakout Potential
4
Novelty
8
Brandon AmosBA#158|@BRANDONDAMOS

FUSE filesystem that infers file contents from LLM latent space using only filenames

philipl/inferencefs223h

It mounts via FUSE so the source tree supplies only names while an LLM backend such as Gemini or Claude is prompted on every read to emit the corresponding bytes, with an in-memory LRU cache and base64 handling for binaries. The approach humorously extends the πfs concept into parametric memory, attracting attention from developers who enjoy infrastructure parodies but offering little path to broad production use.

AI Score
9
Breakout Potential
3
Novelty
7
Samuel "curry-howard fanboi" AinsworthS"#1689|@SAMUELAINSWORTH

Personal GitHub profile README for NVIDIA research scientist in generative 3D and world models

lorraine2/lorraine217h

The repository holds a static profile page listing academic credentials at the Spatial Intelligence Lab along with selected publications on LLaMA-Mesh, amortized text-to-3D, and graph metanetworks. Standard personal academic pages serve only narrow professional networking within specialized AI research circles and introduce no reusable technique or broadly applicable artifact.

AI Score
9
Breakout Potential
1
Novelty
1
Jonathan LorraineJL#1230|@JONLORRAINE9

PyTorch framework for large-scale training of distilled diffusion models across image and video tasks

NVlabs/FastGen8047h

It implements modular training pipelines for consistency models, distribution matching distillation, self-forcing and related methods with DDP/FSDP2 support on architectures such as EDM, SDXL, Flux and CogVideoX. The codebase aggregates established acceleration techniques into a single extensible toolkit that primarily serves diffusion researchers rather than general developers.

AI Score
10
Breakout Potential
4
Novelty
6
Andrea Tagliasacchi @CVPRAT#1602|@TAIYASAKI

OmniDreams autoregressively generates real-time multi-camera photorealistic video from single RGB frames and HD-map conditioning

nv-tlabs/omni-dreams1757h

It ingests an initial frame plus per-chunk text prompts, coarse HD maps, and trajectory poses to produce video chunks that are fed back as input for long rollouts. The approach refines existing world-model techniques with driving-specific distillation for simulation use cases, limiting adoption to autonomous-vehicle teams rather than general video or robotics workflows.

AI Score
10
Breakout Potential
4
Novelty
7
Jonathan LorraineJL#1230|@JONLORRAINE9

High-performance inference and serving library for autoregressive video and world models

NVIDIA/flashdreams2727h

It supplies specialized runners, multi-GPU pipelines, and configuration tooling that target models such as Wan2.1 variants, Cosmos-Predict, and OmniDreams on H100-class GPUs. The narrow hardware footprint and domain focus on real-time closed-loop simulation limit adoption to specialized teams in robotics and autonomous-vehicle research rather than broad generative-AI use.

AI Score
10
Breakout Potential
3
Novelty
6
Jonathan LorraineJL#1230|@JONLORRAINE9

Agent skill that turns raw datasets into verifiable multimodal stories with evidence tracing

QinghongLin/data2story-skill238h

The skill orchestrates a fixed seven-role pipeline of specialized agents that sequentially research context, profile data, craft narrative, generate visuals, emit tagged HTML, audit layout, and build an interactive evidence viewer. This structured multi-agent workflow is novel for data journalism yet targets a narrow audience of analysts and journalists who already work inside coding agents, limiting mass adoption.

AI Score
9
Breakout Potential
4
Novelty
7
Pan LuPL#1353|@LUPANTECH

Automated pipeline constructing 879 lifecycle-aware skill poisoning attacks across 12 risk types for LLM agents

OSU-NLP-Group/SkillHarm58h

Coding agents guided by natural-language harnesses execute target selection, iterative payload design with safety-gated refinement, and reviewer-based quality filtering to produce fixed-payload and self-mutating attack samples that persist across agent sessions. The cross-session mutation technique extends prior poisoning concepts into agent workflows and is therefore most relevant to AI safety and red-teaming teams.

AI Score
10
Breakout Potential
4
Novelty
8
Yu SuYS#412|@YSU_NLP

Terminal AI coding agent with multi-provider LLM support and built-in agentic tools

QwenLM/qwen-code25.1k9h

It runs as a Node.js CLI that connects to OpenAI-compatible, Anthropic, or Gemini endpoints through a configurable settings file and exposes Skills plus SubAgents for repository-scale tasks. The approach follows the established pattern of terminal-first LLM coding tools rather than introducing new primitives, limiting its audience to developers already embedded in command-line workflows.

AI Score
10
Breakout Potential
6
Novelty
4
Sebastian RaschkaSR#167|@RASBT

Workflow layer that adds agent teams, structured prompts, and durable state to OpenAI Codex CLI

Yeachan-Heo/oh-my-codex30.7k9h

OMX installs as a global npm package and wraps Codex CLI sessions with predefined commands such as $deep-interview, $ralplan, and $ultragoal that write plans, logs, and artifacts into a .omx directory while optionally launching tmux-based team runtimes. The approach remains a conventional prompt-and-script orchestration layer whose audience stays limited to existing Codex CLI users rather than becoming a general AI-.

AI Score
9
Breakout Potential
3
Novelty
4
Jeremy Nixon - Automating AI ResearchJN#1346|@JVNIXON

Framework for multitask data attribution on multilingual instruction tuning and math reasoning using SFT and GRPO

uiuctml/convex_data_valuation110h

It implements gradient, kernel mean matching, task vector, and compressed-sensing datamodel attribution methods inside a configurable trainer that runs on the Aya and translated GSM8K datasets. The approach extends existing valuation techniques to post-training regimes rather than pre-training, so adoption is likely limited to research groups already working on data attribution for instruction-tuned models.

AI Score
9
Breakout Potential
4
Novelty
6
Han ZhaoHZ#1228|@HANZHAO_ML