DOCUMENTATION

Data Guide

How to prepare your dataset for Stratensight analysis.

Supported data sources

Stratensight automatically detects your export format and maps columns.

Espacenet / EPO

✅ Full supportHigh confidence

Official EPO database, recommended starting point

Google Patents

✅ Full supportHigh confidence

Free access, excellent for broad technology coverage

LexisNexis PatentSight

✅ Full supportHigh confidence

Professional export with all 4 scores and family dedup

Questel Orbit Intelligence

✅ Full supportHigh confidence

Full 4 scores with FAMPAT family dedup. Covers 300+ patent authorities.

Derwent Innovation

✅ Full supportHigh confidence

All 4 scores available with strong citation data

PatSnap

✅ Full supportHigh confidence

All 4 scores available

Generic CSV

⚡ Basic supportVariable confidence

Any CSV with patent data, core scores available

Required fields

FIELD NAMEDESCRIPTION
patent_id / Publication NumberUnique identifier for each patent
titlePatent title (used for clustering)
abstractAbstract (for AI concept extraction)
filing_dateFiling date (for Momentum Index™ calculation)
assignee / Current AssigneePatent owner (for Openness Score™ competitive analysis)
cpc_codes / CPC ClassificationsTechnology classification (for Lifecycle Position™)

Auto-detection: Stratensight automatically detects your export format and maps columns. No manual configuration required.

Optional fields — improve score accuracy

Including these fields unlocks higher Intelligence Grade™ scores.

forward_citationsImproves Momentum Index™+15% accuracy
family_idEnables patent family deduplicationReduces noise
priority_dateImproves Lifecycle Position™ calculation+8% accuracy
inventorsEnables inventor network analysisNetwork signals
ipc_codesSecondary classification supportBroader coverage

File format

CSV

Recommended
  • UTF-8 encoding
  • Comma or semicolon separator
  • First row = headers

Excel (.xlsx)

Supported
  • Single sheet
  • Headers in row 1
  • Max 50 MB

JSON

Supported
  • Array of objects
  • Camel or snake_case keys
  • UTF-8 encoding

How to export from your database

LexisNexis PatentSight

  1. Select your patent set
  2. Export → Full Record → CSV
  3. Include: Title, Abstract, Filing Date, Current Assignee, CPC, Citations
  4. Max recommended: 10,000 patents per analysis

Questel Orbit Intelligence

  1. Execute your patent search in Orbit Intelligence
  2. Select results and click Export
  3. Choose CSV or Excel format
  4. Select fields: Publication Number, Title, Abstract, Applicant, Filing Date, CPC, Forward Citations, Family ID
  5. UTF-8 encoding recommended for abstracts
  6. Note: Orbit FAMPAT family grouping improves Intelligence Grade™ by enabling patent family deduplication

Derwent Innovation

  1. Select records → Export → Custom fields
  2. Include: PN, TTL, PAEE, Filing Date, Citations
  3. Format: CSV UTF-8

Espacenet

  1. Build query → Results → Download
  2. Select all available fields
  3. Note: citation data may be incomplete

Google Patents

  1. Use BigQuery export for large datasets
  2. Or manual download for < 1,000 patents

How your data quality affects Intelligence Grade™

Intelligence Grade™ gates the confidence of all other scores. Below 0.30, scores are flagged LOW CONFIDENCE.

DATA QUALITYINTELLIGENCE GRADE™IMPACT
LexisNexis / Questel Orbit full export85–99%Full analysis: all 4 scores and Decision Engine™
Partial fields (no citations)65–80%Core scores only
Generic CSV50–70%Basic analysis
< 100 patents40–60%Low confidence, directional only

GET STARTED

Ready to analyze your data?

Upload your patent export and get your first intelligence report in under 60 seconds.

Start your first analysis →