Research
Research and engineering notes on uncertainty quantification, risk-aware machine learning, and trustworthy AI from the Themis AI team.
-
Spotting the Unfamiliar: Out-of-Distribution Detection with Uncertainty
Models fail most dangerously on inputs they've never seen. Epistemic uncertainty flags those out-of-distribution cases in real time — so the system can escalate, abstain, or adapt.
-
Managing Risk in Production AI Systems
Why production AI needs risk measurement, not just predictions — and how to turn a model's uncertainty into enforceable risk policies.
-
NIST's Risk Management Framework and Themis AI
Examining the Risk Management Framework by NIST.
-
Catching LLM Hallucinations with Uncertainty
LLMs fail without warning. A separate uncertainty estimate lets a model abstain, escalate, or defer instead of confidently making things up.
-
Drug Discovery Cost Reduction with Uncertainty-Guided Predictions
Drug Discovery Cost Reduction with Uncertainty-Guided Predictions
-
Uncertainty-Aware Human Intervention for Autonomous Vehicles
Uncertainty-Aware Human Intervention for Autonomous Vehicles.
-
Risk-Aware Hallucination Detection for Arbitrary Generative Models
Risk-Aware Hallucination Detection for Arbitrary Generative Models.
-
Uncertainty-Aware Language Modeling for Selective Question Answering
Presenting uncertainty-aware LLMs capable of estimating uncertainty with every prediction.
-
Probability vs Confidence
At Themis AI, we are building a groundbreaking technology to create and ensure trustworthy and robust AI.
-
Robust and Trustworthy Deep Learning (part 3): Themis AI
Themis AI's cutting-edge technological advancements in robust and trustworthy deep learning
-
Robust and Trustworthy Deep Learning (Part 2): Uncertainty
Themis AI's cutting-edge technological advancements in robust and trustworthy deep learning
-
Robust and Trustworthy Deep Learning (Part 1): Bias
Themis AI's cutting-edge technological advancements in robust and trustworthy deep learning
-
Preliminary Steps Towards Risk-Aware Image Generation
Using Capsa to automatically evaluate the quality of images generated by Stable Diffusion