1d ago

Databricks AI Research forms team for AI agent evaluation

0

Julia Neagu leads Databricks AI Research in assembling a team to measure and improve AI agents operating on enterprise data at scale. The initiative focuses on converting evaluation results into better performance across development, training, and production deployment. Work targets complex analytical tasks in domains such as biotech and finance.

Original post

I'm building a new team at @databricks AI Research and we're hiring. We're focused on one of the hardest open problems in AI right now: how do you measure and continuously improve agents that operate on enterprise data at scale. We're looking for founding engineers to build the flywheel that turns evaluation results directly into better agents — from development and training all the way to production. If you want to work on problems that actually matter at the frontier of AI research, I'd love to talk. Link in comments 👇

12:38 PM · May 15, 2026 View on X
Reposted by

This is a fantastic team. Check it out if you want to help build agents and models that reliably answer the most challenging analytical questions in biotech, finance, etc for our customers.

Julia NeaguJulia Neagu@julianeagu

I'm building a new team at @databricks AI Research and we're hiring. We're focused on one of the hardest open problems in AI right now: how do you measure and continuously improve agents that operate on enterprise data at scale. We're looking for founding engineers to build the flywheel that turns evaluation results directly into better agents — from development and training all the way to production. If you want to work on problems that actually matter at the frontier of AI research, I'd love to talk. Link in comments 👇

7:38 PM · May 15, 2026 · 157.9K Views
8:33 PM · May 15, 2026 · 12.2K Views

An incredibly exciting opportunity. Come work with us! Lots of fun and impactful problems to work on.

Julia NeaguJulia Neagu@julianeagu

I'm building a new team at @databricks AI Research and we're hiring. We're focused on one of the hardest open problems in AI right now: how do you measure and continuously improve agents that operate on enterprise data at scale. We're looking for founding engineers to build the flywheel that turns evaluation results directly into better agents — from development and training all the way to production. If you want to work on problems that actually matter at the frontier of AI research, I'd love to talk. Link in comments 👇

7:38 PM · May 15, 2026 · 157.9K Views
8:07 PM · May 15, 2026 · 2.9K Views
Databricks AI Research forms team for AI agent evaluation · Digg