56% of relevant papers only mention search terms in full-text, not abstracts
Traditional Search
Title + Abstract Only
2,821
44%
Papers Found
OpenAlex Full-Text
Title + Abstract + Full-Text
6,329
100%
Papers Found
Hidden Literature
3,508
56% of Relevant Papers
Only mentioned in methodology, case studies,
comparative analyses, or body text
Why Are They Hidden?
π Comparative Studies
Papers comparing multiple countries mention Russian policy in analysis sections
π¬ Methodology Papers
Research methods papers using Russian policy as example cases
π Historical Analyses
Broader historical works with sections on Russian policy
π Regional Studies
Area studies mentioning Russian policy in regional context
π Book Chapters
Edited volumes with Russian policy discussed in specific chapters
The Zotero Challenge
β οΈ Traditional Zotero Import Limitations
π
Browser Extension
Only sees visible results
π
Metadata Matching
Relies on titles/abstracts
β
Result
Miss 56% of papers
β OpenAlex β Zotero Solution
π
API Export
Get ALL results
π
RIS/BibTeX Format
Convert to Zotero format
β¨
Result
Capture 100% of papers
Why We Need Everything: Dual Purpose
π
Reference Management
β Citations & footnotes
β Bibliographies
β Literature reviews
β Academic writing
π€
LLM Corpus Analysis
β Full-text processing
β Pattern discovery
β Taxonomic annotation
β Knowledge graphs
Precision Γ Recall = Comprehensive Corpus
Maximum relevant papers Γ Minimum noise = Optimal LLM training data
Next Workshop Sessions: LLM-Powered Analysis
1
TODAY: OpenAlex & Zotero Integration
β’ Complete corpus collection (6,196 papers)
β’ Dual-purpose workflow for reference management
β’ Export to RIS/BibTeX for LLM analysis
2
Building Rich Taxonomies with LLMs
β’ Multi-level classification systems
β’ Multiple perspectives (theoretical, methodological, thematic)
β’ Emergent categories from 6,196 papers
β’ LLM-suggested hierarchies
3
Intelligent Corpus Annotation
β’ Chunk documents into segments
β’ Apply taxonomic labels via LLMs
β’ Create knowledge graphs
β’ Discover hidden connections
From 6,196 papers β
Structured knowledge
Including the 3,508 "hidden" papers only findable via full-text search
Workshop Resources & Materials
Everything you need in one place
π Dropbox Folder: /2603 - Boston/
π Main Workshop Materials
βββ π 260311-13 Part 1 - Setting the Scene.pptx (117MB)
βββ π 260311-13 Part 2 - Building Corpora.pptx (74MB)
βββ π 260311-13 Part 3 - Building Taxonomies.pptx (22MB)
βββ π 260311-13 Part 4 - Using LLMs.pptx (33MB)