In case you missed itFormer Anthropic and DeepMind lead Behnam Neyshabur launches Mirendil with $200 million to automate frontier AI engineeringBN#124|@BNEYSHABURRA#102|@_AROHAN_DT#76|@DUSTINVTRANYC#53|@YEJINCHOINKAJD#2|@JEFFDEAN+42 more
…l-Intelligence/openpi4H AGOProvides open-source π₀-series vision-language-action models, checkpoints, and training code for robotics.SD159012.5k stars
mdn/mcp11H AGOImplements an experimental MCP server exposing MDN search, docs, and browser compatibility data to LLMs.SW201103 stars
…/legal-research-bench18H AGOEvaluates LLMs on using tools to research and answer complex legal questions about statutes, regulations, and case law.AD10661 stars
googleworkspace/cli1D AGOProvides unified CLI access to Google Workspace APIs like Drive, Gmail and Calendar, dynamically built from the Discovery Service with AI agent skills.AS29027.8k stars
huggingface/OpenEnv1D AGOInterfaces isolated execution environments for agentic RL training via Gymnasium-style step/reset/state APIs.JM2782.4k stars
…rinceton/ceobench-src1D AGOSimulates 500-day startup operations to benchmark long-horizon LLM agents via business databases, tools, and market events.🎭1014ZL80229 stars
…a/LlamaLanguageModels1D AGOImplements Apple's Foundation Models LanguageModel API for llama.cpp GGUF models in Swift.🎭1014ME8614 stars
switchangel/breaks1D AGOHosts normalized two-bar breakbeat WAV samples plus Strudel config file.🎭1014AH139890 stars
openai/simple-evals1D AGOEvaluates language models on benchmarks like MMLU, GPQA, MATH, and SimpleQA using zero-shot chain-of-thought prompting.🎭1014BA1444.5k stars
StarTrail-org/LEANN1D AGOImplements graph-based vector indexing with selective recomputation for compact on-device RAG.🎭1014MZ44112.6k stars