Versions Compared

Key

  • This line was added.
  • This line was removed.
  • Formatting was changed.
Comment: Reverted from v. 9

Children Display
alltrue
depth5
excerpttrue

 

...

Field-by-field

...

Common attributes

The following table describes some attributes that can be found in most of the data containers.

Attribute

Description

ucid

Unique document identifier based on country, doc-number and kind

mxw-id

Internal record-level identifier

load_source

Identifies the source of the data loaded into Alexandria and can have the following values:
patent-office: It identifies documents published, usually weekly, by PTOs
docdb: Weekly updates from the EPO product docdb/inpadoc
translated: Human translated content
mxw-smt: Data translated by Statistical Machine Translation (SMT)
us-assign: Reassignment data from the USPTO
inpadoc-ls.: Legal status events from the EPO Inpadoc service
ipcr and mcf: Reclassification files from EPO and USPTO, respectively

status

Internal attribute used in update procedures

...

format

...

Element

INID

Description

country

 

The country is represented by a standard 2-character code based on WIPO ST.3. In the case of WO publications, the country is the authority where the application was filed

doc-number

21

doc-number is provided in DOCDB-normalized format.  DOCDB-normalized format usually includes the filing year (2 or 4 digits) at the beginning or end unless the year is already embedded in the application number.

  • US application number does not include series code, but 2-digit year is appended to serial number, for example: 22695699 or 47735700
  • WO application numbers are formatted with 2-digit year followed by 5-digit serial number prior to 2004-01-01. Beginning with application dates in 2004, the format is 4-digit year followed by 6-digit serial number, e.g.  9813495 or 2004000001. Kind code of 'W' must be used to distinguish WO applications from national office applications.
  • JP application numbers include a 2-digit year as suffix until 2000; thereafter the 4-digit year is prepended, e.g. 131699 or 2007000849.
    For a complete discussion of application numbers over time and over multiple authorities, please see Number format concordance - application/priority numbers at
    http://www.epo.org/searching/essentials/data/tables.html |

kind

23

Kind code of applications comes from DOCDB (A=patent; U=utility; P=provision; W= PCT; F=design; T=translation)

date

22

Date of filing (YYYYMMDD)

lang

 

Filing language, 2-character code based on ISO 639-2. Default value is "XX"

...

CLAIMS XML format uses a UTF-8 character set and is based on the Standard ST.36, which we have extended to include our value-added IFI Snapshot in the ifi-integrated-content container. The following XML content description describes each textual data unit in terms of its elements and attributes. It includes details about INID codes, expanded descriptions of the contained data, and examples. The Common XML Attributes and Common XML Elements sections contain attributes and elements that can be found in many of the data containers. The Field-by-Field section provides descriptions of each container.

Children Display
alltrue
depth2
excerpttrue
excerptTypesimple