You are viewing an old version of this page. View the current version.

Compare with Current View Page History

Version 1 Next »

 

Field-by-field

Common attributes

The following table describes some attributes that can be found in most of the data containers.

Attribute

Description

ucid

Unique document identifier based on country, doc-number and kind

mxw-id

Internal record-level identifier

load_source

Identifies the source of the data loaded into Alexandria and can have the following values:
patent-office: It identifies documents published, usually weekly, by PTOs
docdb: Weekly updates from the EPO product docdb/inpadoc
translated: Human translated content
mxw-smt: Data translated by Statistical Machine Translation (SMT)
us-assign: Reassignment data from the USPTO
inpadoc-ls.: Legal status events from the EPO Inpadoc service
ipcr and mcf: Reclassification files from EPO and USPTO, respectively

status

Internal attribute used in update procedures

format

Designates the normalized or not-normalized format of the following document-id.
"epo" is the standardized DOCDB format
"original" is the unparsed format directly from the data source

Rule: "original" format is only provided if "epo" format is not available

Element

INID

Description

country

 

The country is represented by a standard 2-character code based on WIPO ST.3. In the case of WO publications, the country is the authority where the application was filed

doc-number

21

doc-number is provided in DOCDB-normalized format.  DOCDB-normalized format usually includes the filing year (2 or 4 digits) at the beginning or end unless the year is already embedded in the application number.

  • US application number does not include series code, but 2-digit year is appended to serial number, for example: 22695699 or 47735700
  • WO application numbers are formatted with 2-digit year followed by 5-digit serial number prior to 2004-01-01. Beginning with application dates in 2004, the format is 4-digit year followed by 6-digit serial number, e.g.  9813495 or 2004000001. Kind code of 'W' must be used to distinguish WO applications from national office applications.
  • JP application numbers include a 2-digit year as suffix until 2000; thereafter the 4-digit year is prepended, e.g. 131699 or 2007000849.
    For a complete discussion of application numbers over time and over multiple authorities, please see Number format concordance - application/priority numbers at
    http://www.epo.org/searching/essentials/data/tables.html |

kind

23

Kind code of applications comes from DOCDB (A=patent; U=utility; P=provision; W= PCT; F=design; T=translation)

date

22

Date of filing (YYYYMMDD)

lang

 

Filing language, 2-character code based on ISO 639-2. Default value is "XX"

  • No labels