AI Monitors

AI Monitors

Links last updated January 14, 2026.

Capability - Towards human-level intelligence and beyond

How close are we to human-level intelligence and autonomous AI?

AIStats.live: Real-Time Comparison of AI progress vs. Human Performance Benchmarks
Monitor and analyze live benchmarks tracking the evolving capabilities of artificial intelligence against human performance across key domains. Stay informed on cutting-edge AI progress and its real-world impact.

Leaderboards

LLM Leaderboard - Comparison of over 100 AI models from OpenAI, Google, DeepSeek & others | Artificial Analysis
Comparison and ranking the performance of over 100 AI models (LLMs) across key metrics including intelligence, price, performance and speed (output speed - tokens per second & latency - TTFT), context window & others.
LLM Leaderboard 2025
This AI leaderboard shows comparison of capabilities, price and context window for leading commercial and open-source LLMs, based on the benchmark data provided in technical reports in 2025.

User sentiment - ELO ranking

Chatbot Arena +
This leaderboard is based on the following benchmarks. Chatbot Arena - a crowdsourced, randomized battle platform for large language models (LLMs). We use 5M+ user votes to compute Elo ratings. AAII - Artificial Analysis Intelligence Index v3 aggregating 10 challenging evaluations. ARC-AGI - Artificial General Intelligence benchmark v2 to measure fluid intelligence.

Safety Scoring

Detailed safety ranking of Frontier AI developers, and breakdown on various safety metrics.

Read up on different AI risks with the easy-to-read guide made by Center for AI Safety

By reputable Future of Life Institute:

AI Safety Index Winter 2025 - Future of Life Institute
The Winter 2025 edition of our AI Safety Index, in which AI experts rate eight leading AI companies on key safety and security domains.

STANFORD Foundation Model Transparency Index (no live tracking):
Tracking many major companies (not just leading AI developers) on Transparency scoring

Foundation Model Transparency Index

Hallucination (and confidence in wrong answers): Omniscience Index

AA-Omniscience: Knowledge and Hallucination Benchmark | Artificial Analysis
Compare AI model performance on AA-Omniscience: Knowledge and Hallucination Benchmark. A benchmark measuring factual recall and hallucination across various economically relevant domains.

AI dominance - And the race to supremacy

AI Index | Stanford HAI
The mission of the AI Index is to provide unbiased, rigorously vetted, and globally sourced data for policymakers, researchers, journalists, executives, and the general public to develop a deeper understanding of the complex field of AI. To achieve this, we track, collate, distill, and visualize dat

Oxford insights 2025 Index

2024 Government AI Readiness Index
Government AI Readiness Index 2024

Oxford Insight's Index 2024 here

Incidents

By company: AI Incident Database

Note: OpenAI continues to lead in number of incidents, followed by Google

Entities
Entities involved in AI Incidents

Incidents over time Global/Country: MIT AI Incident Tracker & OECD

MIT AI Incident Tracker
The MIT AI Incident Tracker project classifies real-world, reported incidents by risk domain, causal factors, and harm caused.
OECD AI Policy Observatory Portal