Doubling Rate Correction
Prompt
Change all references AI doubling rate from 3-4 months to 7 months and cite the METR paper in the references
Response
AI Doubling Rate Correction Session Log
Date: 2025-11-16 Task: Update all AI doubling rate references from 3-4 months to 7 months and cite METR paper Status: ✓ Complete
Session Overview
Updated all references to the AI doubling rate throughout the textbook to reflect the latest research from METR (Model Evaluation and Threat Research). Changed the doubling period from "3-4 months" or "3.4 months" to "7 months" based on the METR paper "Measuring AI Ability to Complete Long Tasks" published March 19, 2025.
Step 1: Initial Search
Search Performed: Searched for all mentions of "doubling", "3-4 months", and "3.4 months" across the documentation.
Files Found with References:
1. docs/glossary.md - AI Doubling Rate definition
2. docs/chapters/01-foundations-ai-nlp/quiz.md - Quiz question answer
3. docs/chapters/01-foundations-ai-nlp/index.md - Multiple instances in chapter content
4. docs/faq.md - FAQ answer
5. Various log files (not updated, as they're historical records)
Total Instances Found: 6 instances requiring updates in 4 files
Step 2: METR Paper Research
Web Search Query: "METR AI doubling rate 7 months task completion paper"
Paper Found: - Title: Measuring AI Ability to Complete Long Tasks - Publication Date: March 19, 2025 - Publisher: METR (Model Evaluation and Threat Research) - URL: https://metr.org/blog/2025-03-19-measuring-ai-ability-to-complete-long-tasks/ - arXiv: https://arxiv.org/abs/2503.14499
Key Authors: - Thomas Kwa - Ben West - Joel Becker - Amy Deng - Plus 17 additional contributors
Main Finding: The 50% task completion time horizon has been growing exponentially from 2019–2025, with a doubling time of approximately seven months.
Key Metric: "50%-task-completion time horizon" - the time humans typically take to complete tasks that AI models can complete with 50% success rate.
Current State: Current frontier AI models (like Claude 3.7 Sonnet) have a 50% time horizon of around 50 minutes.
Future Projection: If trends persist 2-4 years, "generalist autonomous agents will be capable of performing a wide range of week-long tasks."
Step 3: Reference Update
Status: METR paper was already added to /docs/references.md
Location: Top of references file in new "AI Capability Increases" section
Citation Format:
1 2 | |
Step 4: Content Updates
Update 1: docs/glossary.md (Line 33)
Original:
1 | |
Updated:
1 | |
Rationale: Updated to reflect METR research findings
Update 2: docs/chapters/01-foundations-ai-nlp/quiz.md (Line 72)
Original:
1 | |
Updated:
1 | |
Rationale: Quiz answer must reflect current research data
Update 3: docs/chapters/01-foundations-ai-nlp/index.md (Line 116)
Original:
1 | |
Updated:
1 | |
Changes: - "3.4 months" → "7 months" - "OpenAI" → "METR" (proper attribution to source)
Update 4: docs/chapters/01-foundations-ai-nlp/index.md (Lines 161-163)
Original:
1 2 3 4 | |
Updated:
1 2 3 4 | |
Changes: - "3.4 months from 2012-2018" → "7 months from 2019-2025" (updated period and rate) - "7x" → "3.4x" (recalculated: 24 months ÷ 7 months ≈ 3.4x) - Updated third bullet to reflect current state rather than historical milestone
Update 5: docs/chapters/01-foundations-ai-nlp/index.md (Line 571)
Original:
1 | |
Updated:
1 | |
Rationale: Key concepts summary must be accurate
Update 6: docs/faq.md (Line 225)
Original:
1 | |
Updated:
1 | |
Rationale: FAQ must provide accurate information
Step 5: Verification
Final Search Performed:
1 | |
Result:
1 | |
Files Not Updated:
Log files in /logs/ directory were intentionally not updated as they represent historical records of previous sessions.
Summary Statistics
Files Updated: 4 Total Edits: 6 Old Value: 3-4 months / 3.4 months New Value: 7 months Research Period Updated: 2012-2018 → 2019-2025 Source Attribution Updated: OpenAI → METR
Key Changes to Metrics
Moore's Law Comparison Recalculation
Old Calculation: - Moore's Law: 24 months - AI Doubling: 3.4 months - Ratio: 24 ÷ 3.4 ≈ 7x faster
New Calculation: - Moore's Law: 24 months - AI Doubling: 7 months - Ratio: 24 ÷ 7 ≈ 3.4x faster
Note: While the AI doubling rate is slower than previously reported, it still significantly exceeds Moore's Law's improvement rate.
Important Context
Why the Rate Changed
The original 3.4-month doubling rate came from earlier research (2012-2018 era) that measured different aspects of AI progress, particularly focused on computational resources allocated to training.
The new 7-month rate from METR measures a different, more practical metric: - Task completion capability: Length of tasks AI can complete with 50% probability - Time period: 2019-2025 (includes GPT-2 through current frontier models) - Methodology: Empirical testing on diverse real-world tasks - Current capability: ~50-minute tasks for frontier models (Claude 3.7 Sonnet)
Implications for the Course
- Still Rapid Growth: 7-month doubling is still extremely fast compared to historical technological progress
- More Conservative: Provides more realistic expectations for students
- Better Grounded: Based on recent, empirical research with transparent methodology
- Practical Focus: Measures actual capability on real tasks, not just computational resources
Related Content Not Updated
Why Log Files Were Not Updated
Files in the /logs/ directory represent historical records and were intentionally preserved:
- logs/quiz.md - Historical record of quiz generation
- logs/ch01.md - Historical record of chapter 1 creation
- logs/references.md - Historical record of reference generation session
These files document the state of knowledge at the time they were created and serve as audit trails.
Verification Checklist
- [x] METR paper added to references (already present)
- [x] All "3.4 months" instances updated to "7 months"
- [x] All "3-4 months" instances updated to "7 months"
- [x] Source attribution updated (OpenAI → METR)
- [x] Time period updated (2012-2018 → 2019-2025)
- [x] Moore's Law comparison recalculated (7x → 3.4x)
- [x] Glossary updated
- [x] Quiz updated
- [x] Chapter content updated (3 instances)
- [x] FAQ updated
- [x] Final verification completed (no remaining instances)
Tools Used
- Grep - Pattern search across files
- WebSearch - Found METR paper
- WebFetch - Verified METR paper details and extracted key information
- Read - Read file sections for context
- Edit - Updated file content
- Bash - Final verification
Next Steps Recommended
For Course Maintenance:
- Monitor METR Research: Check for updates to the 7-month figure annually
- Update Charts: If any MicroSims or charts show the 3.4-month rate, update them
- Review Videos/Slides: Check any external course materials for old rate
- Student Communication: If course is currently running, notify students of the update
For Future Accuracy:
- Primary Sources: Always cite specific research papers for metrics
- Date Context: Include time periods when citing rates (e.g., "7 months from 2019-2025")
- Regular Reviews: Review key metrics annually as AI field evolves rapidly
- Multiple Metrics: Consider discussing different doubling rate metrics (compute vs. capability vs. task length)
References
Primary Source: - Measuring AI Ability to Complete Long Tasks - METR, March 19, 2025 - arXiv version
Related Resources: - METR Analysis Code (GitHub) - METR Evaluation Infrastructure
Session log completed: 2025-11-16 All AI doubling rate references updated from 3-4 months to 7 months Total files modified: 4 Total edits: 6 Verification: ✓ Complete