AI & Labor: Global Conversation Map

What is this? This dashboard maps the global discourse on AI and labor -- analyzing how policy statements and academic research address automation, surveillance, collective bargaining, and worker protection across 12 regions. It draws from the Tapestry database, classifying documents by 17 labor sub-themes using English-language keyword heuristics.

v13 Headline: Surveillance theme precision corrected from 51.3% to 87.2%

The prior dashboard (v12) reported surveillance theme heuristic-precision of 51.3%, raising questions about theme retention. A Session 16 post-hoc exploratory reanalysis revealed this was an artifact of a 1,500-character scan window -- the audit script only examined the first 1,500 chars of each extract, missing labor-surveillance context that appeared deeper in the text.

Widening to the full 10,000-character extract body corrected 42 FP→TP flips with zero regressions. Full-population heuristic-precision: 87.2% (102/117 items). Within-sample (n=60): 91.7% (Wilson 95% CI [81.9%, 96.4%]).

What changed v12→v13: Scan window widened from 1,500 to 10,000 chars. What did NOT change: corpus, keyword patterns, classification rules, statement count, all non-surveillance theme classifications.

15 items remain flagged as candidate false positives. 2 are likely heuristic-missed true positives (TUC AI Bill, CCOO algorithmic management guide); 13 are likely genuine false positives requiring human adjudication.

Total Documents

Statements

Policy & position documents

Papers

Academic literature

Countries

Themes

Distinct topic categories

Date Range

Key Findings

Top Themes

Source Breakdown

Region × Theme Heatmap

Explore the intersection of geographic regions and thematic categories. Toggle between raw counts and normalized views to reveal structural patterns.

Normalize

Source

Region × Theme Heatmap

Row Marginals (by Region)

Column Marginals (by Theme)

Geographic Distribution

Countries by Volume

Region Totals

Statements vs Papers by Region

Source Language Distribution

Temporal Trends

Theme Emergence Over Time

Statements vs Papers Over Time

Cumulative Growth

Voice Gap Analysis

Who is shaping the conversation on AI and labor? This tab reveals imbalances between those authoring policy and those most affected.

Speaker Types

Binding Nature

CST Citation Tracker

CST Engagement by Region

Authored vs Affected by Region

Methodology & Limitations

Classification Pipeline: All 17 labor themes are classified using English-language keyword regex patterns applied to title + full extract body (up to 10,000 characters). Non-English documents are systematically disadvantaged, contributing to higher cross-cutting rates in regions with lower English proficiency.

Language Bias: The theme classifier uses English keyword patterns exclusively. Statements in non-English primary languages are almost certainly under-classified on all themes, not just surveillance. This is an epistemological limitation: concepts like “gig work” and “just transition” carry different meanings across linguistic traditions.

Text Extraction: Document extracts are capped at 10,000 characters (EXTRACT_CAP), creating position-dependent classification bias for longer documents. If surveillance context appears beyond 10,000 characters, it will not be detected.

Region Assignment: Items without geographic metadata are assigned to “Global/International” by default via normalize_region(), inflating that category.

Surveillance Theme (v13 update): The surveillance theme uses a two-pass keyword gate with co-occurrence requirements. Three items (STMT-0496, STMT-1551, STMT-2040) are excluded by hard-coded rules. The v12 dashboard reported heuristic-precision of 51.3% based on a 1,500-character scan window. This was a methodological artifact: the window-cap fix (identified post-hoc after inspecting false positives) widened the scan to the full 10,000-character extract body, correcting heuristic-precision to 87.2% (n=117 census, no CI). The 87.2% figure is a corrected-pipeline estimate, not a replication. 15 residual candidate false positives remain for human adjudication. The surveillance theme is composite, encompassing algorithmic management, biometric monitoring, bossware, and workplace monitoring.

Religious Ethics Tracker: Originally developed to measure Catholic Social Teaching (CST) citations. Expanded in v12 to include Islamic, Protestant/Ecumenical, Buddhist, Jewish, and Gandhian/Indian ethical economics sources. Detection rates reflect both corpus composition and keyword specificity; absence of matches does not indicate absence of discourse in a tradition.

Collection Bias: The Tapestry database conducted 12+ targeted ingestion waves that systematically prioritized 2023–2026 documents. Temporal concentration figures reflect collection strategy alongside genuine discourse growth.

Heuristic vs. Human Judgment: All precision figures in this dashboard are heuristic-precision — the fraction of keyword-flagged items judged relevant by the heuristic rules themselves. Only manual domain-expert review can establish true classification precision. Heuristic-precision is a lower bound on true precision only if the heuristic’s errors are biased toward false negatives.

What Changed: v12 → v13

Aspect	v12 (2026-04-12)	v13 (2026-04-13)
Surveillance scan window	1,500 characters	10,000 characters (full EXTRACT_CAP)
Surveillance heuristic-precision	51.3% (117-item census)	87.2% (117-item census, 42 FP→TP flips)
Corpus size	Unchanged (~3,253 statements + ~2,368 papers)
Keyword patterns	Unchanged (17 theme patterns)
Classification rules	Unchanged (two-pass gate, co-occurrence, exclusion list)
Non-surveillance themes	Unchanged
Narrative annotations	S15 framing	S16 corrected (post-hoc exploratory reanalysis)