Versions Compared

Key

  • This line was added.
  • This line was removed.
  • Formatting was changed.

...

asrch is a command-line tool used to search an optional on-site installation of SOLR Solr and extract data either in SOLR Solr response format or complete CLAIMS Direct XML. It is installed as part of the CLAIMS Direct repository. Please see the Client Tools Installation Instructions for more information about how to install this tool.

Code Block
languagetext
asrch [Options ...] query
  --url=s       search URL (excluding /select)
                  (default=http://solr.alexandria.com:8080/alexandria-index/alexandria)
  --raw         output raw SOLRSolr XML
  --count       output total documents found
  --maxrows=i   maximum documents to output
                  this argument is ignored when using --table
  --output=file specify output file
  --dtdpublic=pi  Public Identifier for DTD
  --dtdsystem=si  System Identifier for DTD
  Output Options
  --------
  --archive     archive result set documents into predictable path
                directory structure (Alexandria XML only)
  --archiveroot=dir
                root directory to place result set (default=.)
  --wrapper=s   wrap multiple documents in wrapper-named element
                default=patent-documents
  --pretty      indent output
  SOLR Options
  --------
  --solropt=s@  Solr options.
    e.g., --solropt=sort=f1,f2,f3 --solropt=rows=30
    See: http://wiki.apache.org/solr/CommonQueryParameters
  DWH Options
  --------
  --pgdbname     as defined in /etc/alexandria.xml (default=alexandria)
  --dbfunc       extract UDF (default=xml.f_patent_document_s)
  --table=s      If specified, a table of UCIDs/publication_ids is
                 created -- could later be used for indexing
    --truncate  truncate --table if it currently exists
  --help         print this usage and exit

...

Parameter

Description
pgdbnameAs configured in /etc/alexandria.xml, the database entry pointing to the on-site CLAIMS Direct PostgreSQL instance. The default value is alexandria as this value is pre-configured in /etc/alexandria.xml.
urlThis is the URL of the CLAIMS Direct SOLR Solr instance.

Output Options

The following parameters specify output possibilities.

ParameterDescription
outputOutput results to named file. The default output goes to stdout.
archiveArchive results in a predictable path structure. See aext.
archiverootThe root directory of the archive. See aext.
wrapperDefault top-level XML element. The default is patent-document.
prettyIndent the output XML.
countOnly output the count of documents.
maxrowsMaximum number of documents to output. If using the --table option, this parameter is ignored.
table

If specified, a table of UCIDs/publication_ids is created.

rawThis parameter specifies SOLR Solr response XML as format.

...

Solr Options

ParameterDescription
solropt

Raw SOLR Solr query parameters. This parameter can be used multiple times, e.g.,


--solropt='sort=pd desc' --solropt='fq=pnctry:us'

...

Code Block
languagetext
asrch  --count \
       --url=http://SOLR-INSTANCE-URL/alexandria-v2.1/alexandria \
'loadid:261358'
-> executing search ...  (found 4613; done in 0.095)
4613

Output Select Fields in

...

Solr XML

The following example searches SOLR Solr and returns the results in XML format.

Info

You can return SOLR Solr results in a variety of formats using the query parameter wt. For a detailed list of output format options, see https://cwiki.apache.org/confluence/display/solr/Response+Writers.

...