Page tree
Skip to end of metadata
Go to start of metadata

The following table describes attributes that can be found in most of the data containers.

Attribute

Description

ucid

Unique document identifier based on country, doc-number, and kind, i.e. US-96142365-A

Note: the doc-number may be an application or publication number. See application-reference and publication-reference

mxw-id

Internal record-level identifier

load_source

Identifies the source of the data loaded into Alexandria. Values include: 

patent-office: Identifies documents published, usually weekly, by PTOs 
docdb: Weekly updates from the EPO product DocDB/Inpadoc 
google: English translations from Google's translation service
translated: Human-translated content 
mxw-smt: Data translated by Statistical Machine Translation (SMT) 
us-assign: Reassignment data from the USPTO 
inpadoc-ls: Legal status events from the EPO Inpadoc service 
ipcr and mcf: Reclassification files from EPO and USPTO, respectively
ifi: Value-added data from IFI CLAIMS processing

status

Internal attribute used in update procedures. Values include:

    • new
    • corrected
    • deleted

A deleted status can mean one of two things:

  1. The originating data provider (patent office or third party) has requested that we remove a document.
  2. The originating data provider (patent office or third party) has changed the makeup of the ucid either by remapping a kind code or changing the format of the publication number.

In both of these cases, CLAIMS Direct sets the status=deleted, deleted_load_id=<load-id> and removes all data in all satellite tables, although deleted records are indexed. Documents marked @status=deleted should never be extracted from your CLAIMS Direct instance or, if they are marked at a later date, should be removed from downstream processing.

ref-uciducid of related document used for reference data, such as a PCT application
format

Designates the normalized or not-normalized format of the following document-id:

FormatDescriptionExample(s)
epo

Standardized name according to the DocDB file, all caps, no punctuation, limited to 30 characters.
For the names of individuals, the format is LAST_NAME FIRST_NAME MIDDLE_NAME/INITIAL.
See the EPO website for DocDB's list of standardized names.

1. IBM 
2. FINLAND TELECOM OY 
3. THE UNITED STATES GOVERNMENT
4. TUPPER ALAN WILLIAM

intermediatePre-standardized name, converted to all caps.
For the names of individuals, the format is LAST_NAME, FIRST_NAME MIDDLE_NAME/INITIAL.
1. INTERNATIONAL BUSINESS MACHINES 
2. TELECOM FINLAND OY 
3. DEPARTMENT OF THE NAVY
4. TUPPER, ALAN WILLIAM
original

Name as filed, provided directly from the publishing source (can be in non-Latin characters).
For the names of individuals, the format is Last_name, First_name Middle_name/initial.

1. International Business Machines Corporation 
2. Sonera Oyj 
3. The United States Government as represented by the Department of the Navy
4. Tupper, Alan William

  • No labels