🛠 Open Source / Software

I value practical and reproducible research. This page highlights open-source libraries, benchmarks, and system tools built by my group and collaborators. Many of these projects support AI risk audit and control across the deployment stack: anomaly and out-of-distribution detection in data, trust and robustness evaluation of foundation models, and runtime audit and control of deployed agent systems, with applications in science and high-stakes domains. Several of these projects have been accepted into the Anthropic Claude for Open Source Program. For all repositories, see my GitHub profile.

External adoption: PyOD is named by OpenAI as expected operational tooling in its Technical Intelligence Analyst role, shipped as a first-class ModelHandler in Apache Beam (Apache Software Foundation), runs the live-traffic alerting subsystem in PostHog (34K+ stars), is the canonical anomaly-detection flavor in MLflow's community-flavor docs, and is embedded in Genentech (Roche) drug-discovery validators. Across the wider ecosystem, 5,493 public repositories and 139 packages depend on PyOD (May 2026 snapshot). DoD CDAO lists PyOD and TrustLLM; ESA OPS-SAT uses PyOD; NIST AI 100-2e2025 and the FLI AI Safety Index cite TrustLLM.

Adoption & Recognition

PyOD — Python Library for Outlier Detection

9,800+ stars · 42M+ downloads · 5,493 dependent repos + 139 packages (GitHub Dependents, May 2026) · GitHub

Type	Evidence	Source
AI Lab	OpenAI Careers names PyOD as expected operational tooling in the Technical Intelligence Analyst job posting Qualifications block: "Have experience with anomaly detection tools, such as PyOD, and discovery processes for surfacing novel or low-prevalence patterns."	OpenAI · snapshot
Space Agency	Selected by ESA for OPS-SAT spacecraft telemetry benchmark (all 30 algorithms)	Nature Sci. Data
U.S. DoD	CDAO Generative AI Responsible AI Toolkit lists PyOD as a Production / High-maturity OOD-detection tool (entry p.49, embedded in Stage 3.1.10 assessment workflow)	ai.mil PDF
EU Project	SEDIMARK Horizon Europe D3.1 (p.18) names PyOD and TODS in the outlier-detection module of the EU data-space toolbox	SEDIMARK D3.1
Gov / Labs	Cited in research papers by authors affiliated with Deutsche Bundesbank, NIH, CDC, RAND, NASA JPL, German DLR and DESY, and the Sandia, Brookhaven, and Argonne national labs, plus multiple Fraunhofer institutes (citing ECOD, COPOD, PyOD, ADBench, TODS, and LSCP)	Audit details
Platform	Apache Software Foundation / Apache Beam (8.5K+ stars) ships a first-class PyOD ModelHandler at `sdks/python/apache_beam/ml/anomaly/detectors/pyod_adapter.py`; Apache Beam underlies Google Cloud Dataflow	apache/beam
Enterprise	PostHog (34K+ stars, YC unicorn product analytics) runs a multi-detector PyOD subsystem at `posthog/tasks/alerts/detectors/pyod_detectors/` for live-traffic alerting (eight algorithm wrappers: KNN, IForest, COPOD, ECOD, OCSVM, LOF, PCA, HBOS)	PostHog/posthog
Platform	MLflow (25.8K+ stars) official community-flavor docs list PyOD as the canonical anomaly-detection flavor with worked KNN-detector example via `mlflavors`	mlflow/mlflow
Pharma	Genentech (Roche) Data Detective embeds PyOD/ADBench in its drug-discovery validator factories (`adbench_validator_method_factory.py`, `adbench_multimodal`, `adbench_ood_inference`)	Genentech/data-detective
Enterprise	Walmart real-time pricing anomaly detection (1M+ daily updates)	KDD 2019
Enterprise	Databricks Kakapo framework for unsupervised outlier detection	Databricks Blog
Enterprise	IQVIA healthcare fraud detection (123K+ pharmacy claims)	SUOD Paper
Enterprise	Ericsson Anomaly Detection Framework (E-ADF) built on PyOD	Ericsson Blog
Patents	12 patents cite PyOD/COPOD/ECOD/LSCP/SUOD (WIPO x2, EU x2, US x4, China x3, Slovakia x1); recent additions include Actimize US20230267468A1 (fraud / anomalous-transaction ML) and Slovak utility model SK2042023U1 (full PyOD detector suite)	Ericsson · Actimize · SK
Encyclopedia	Wikipedia "Anomaly detection" Software section names PyOD; reference list cites Zhao, Nasrullah, Li 2019 JMLR	Wikipedia
Education	Featured in 5 books (Manning, O'Reilly, Apress, Routledge, IntechOpen)	Manning
Education	DataCamp course with dedicated chapter (19M+ platform learners)	DataCamp

ADBench — Anomaly Detection Benchmark

1,000+ stars · NeurIPS 2022 · GitHub

Type	Evidence	Source
Consulting	Deloitte Germany cites ADBench in an AIxAML anti-money-laundering transaction-monitoring solution	Deloitte PDF
Enterprise	Cited in papers with authors affiliated with Microsoft Research, Tencent, Amazon, BlackRock, Visa, Bosch, Siemens, and Ericsson	Audit details
Pharma	Genentech (Roche) Data Detective uses ADBench in `adbench_validator_method_factory.py`, `adbench_multimodal`, and `adbench_ood_inference` validator factories for drug-discovery data validation	Genentech/data-detective

TrustLLM — Trustworthiness Benchmark for LLMs

620+ stars · ICML 2024 · GitHub

Type	Evidence	Source
U.S. Senate	Cited in HSGAC "Hedge Fund Use of Artificial Intelligence" report (footnote 119)	Senate PDF
U.S. DoD	Listed in CDAO Generative AI Responsible AI Toolkit	ai.mil PDF
NIST	Named in NIST AI 100-2e2025 Section 3.6 "Benchmarks for AML Vulnerabilities"	NIST PDF
Policy	Official benchmark in all 3 editions of the FLI AI Safety Index (2024, 2025 x2)	FLI Report · Indicator Sheet
National Lab	Lawrence Livermore National Laboratory feature article; LLNL/DOE SafeAI report cites TrustLLM	LLNL · SafeAI PDF
International	Cited in International AI Safety Report 2026 (citation #881; led by Yoshua Bengio, 100+ experts, 30+ countries)	Report
Media	Featured by 机器之心 (Jiqizhixin) and 澎湃新闻 (The Paper)	机器之心 · 澎湃
Enterprise Editorial	Samsung SDS Insights treats TrustLLM as a flagship LLM trustworthiness evaluation framework in its Korean enterprise editorial; reference list cites arXiv:2401.05561	Samsung SDS

Recent Institutional Visibility — 2026 research releases

Selected institutional pages for newer benchmarks and datasets

Type	Evidence	Source
Institute	Vector Institute highlights TrustGen in its ICLR 2026 research roundup	Vector
Industry Lab	Adobe Research lists FigEdit ("Charts Are Not Images") and the benchmark release	Adobe Research

TDC — Therapeutics Data Commons

1,200+ stars · NeurIPS 2021 · with Harvard & Stanford · GitHub

Type	Evidence	Source
Journal	Published in Nature Chemical Biology (2022)	Nature Chem. Bio.
University	Harvard Medical School feature: "Can AI transform drug discovery?"	HMS News
Science Press	Phys.org syndication of Harvard article	Phys.org
Industry	Amazon Science feature article	Amazon Science
Pharma	Cited by researchers at AstraZeneca, Pfizer, Roche, Novartis, Merck, Sanofi, Eli Lilly	Audit details
Labs	Cited in papers by researchers at Los Alamos and Brookhaven national labs (cheminformatics, model uncertainty) and OpenAI (biomedical reasoning)	Audit details

DoxBench — Geolocation Privacy Leakage Benchmark

ICLR 2026 · GitHub

Type	Evidence	Source
Policy	Cited by Privacy International in "Nowhere to Hide? Privacy Risks and Policy Implications of AI Geolocation" (p.28, footnote 56)	Report
Chinese Media	机器之心Pro reporting via Sina names "南加州大学教授赵越（Yue Zhao）团队", paper title "Doxing via the Lens", and the arXiv link	Sina

Full Project Portfolio

Flagship project pages: agent-style, anywhere-agents, agent-audit, Aegis, PyOD 3.

Sort by:

No projects listed yet. Add entries to data/open-source.json to populate this page.