Versions Compared

Key

  • This line was added.
  • This line was removed.
  • Formatting was changed.

...

Attribute

Description

ucid

Unique document identifier based on country, doc-number and kind

mxw-id

Internal record-level identifier

load_source

Identifies the source of the data loaded into Alexandria and can have the following values: 
patent-office: Identifies documents published, usually weekly, by PTOs 
docdb: Weekly updates from the EPO product DocDB/Inpadoc 
translated: Human-translated content 
mxw-smt: Data translated by Statistical Machine Translation (SMT) 
us-assign: Reassignment data from the USPTO 
inpadoc-ls: Legal status events from the EPO Inpadoc service 
ipcr and mcf: Reclassification files from EPO and USPTO, respectively

status

Internal attribute used in update procedures

format

Designates the normalized or not-normalized format of the following document-id:

"epo" is the standardized DocDB format 
"original" is the unparsed format directly from the data source 
Rule
FormatDescriptionExample(s)
epo

Standardized name according to the DOCDB file, all caps, no punctuation, limited to 30 characters.
For the names of individuals, the format is LAST_NAME FIRST_NAME MIDDLE_NAME/INITIAL.

1. IBM 
2. FINLAND TELECOM OY 
3. THE UNITED STATES GOVERNMENT
4. TUPPER ALAN WILLIAM

intermediatePre-standardized name, converted to all caps.
For the names of individuals, the format is LAST_NAME, FIRST_NAME MIDDLE_NAME/INITIAL.
1. INTERNATIONAL BUSINESS MACHINES 
2. TELECOM FINLAND OY 
3. DEPARTMENT OF THE NAVY
4. TUPPER, ALAN WILLIAM
original

Name as filed, provided directly from the publishing source (can be in non-Latin characters).
For the names of individuals, the format is Last_name, First_name Middle_name/initial.

1. International Business Machines Corporation 
2. Sonera Oyj 
3. The United States Government as represented by the Department of the Navy
4. Tupper, Alan William

Note: "original" format is only provided if "epo" format is not available