The PostgreSQL component is the heart of CLAIMS Direct. It contains the XML for the entire data warehouse collection, processes updates from the primary, and functions as data source for the optional SOLR index.

Hardware Requirements test

Requirement	Recommended
CPU	4-cores
System Memory	24GB
Storage Capacity	4TB (SSD preferred)

Software Requirements

Requirement

Minimum Version

Notes

Operating System RHEL 7, Fedora 25-27, CentOS 7, Amazon Linux AMI If you are using Fedora, you must use version 27 or below. Installing Fedora v28 and above will install a version of PostgreSQL that is incompatible with CLAIMS Direct.

PostgreSQL

9.4

yum -y install /

    postgresql postgresql-contrib \
    postgresql-odbc postgresql-plperl \
    postgresql-server

Note: Versions 10 and above are incompatible with CLAIMS Direct.

IFI CLAIMS Repository

RHEL/CentOS 7
yum -y install \ 
 https://repo.ificlaims.com/ifi-claims-direct/centos/7/x86_64/ifi-claims-direct-beta-0.0.2-1.el7.centos.x86_64.rpm

Note: CentOS 7 needs an additional repository: yum -y install epel-release

Amazon Linux
yum -y install \
 https://repo.ificlaims.com/ifi-claims-direct/amazon/x86_64/ifi-claims-direct-beta-0.0.2-1.amzn1.x86_64.rpm

Fedora
yum -y install \
 https://repo.ificlaims.com/ifi-claims-direct/fedora/27/x86_64/ifi-claims-direct-beta-0.0.2-1.fc27.x86_64.rpm

libxml2

Some CLAIMS Direct loading and maintenance code utilizes the PostgreSQL perl extension (plperl) as well as a heavy reliance on the libxml2 XML parsing library. IFI CLAIMS has produced a patched release of libxml2 as an RPM. It is highly recommended to update libxml2 from the IFI CLAIMS software repository. For additional distributions, please contact support@ificlaims.com.

PostgreSQL Installation

Regardless of installation type, careful planning of disk resources is important for efficient data loading into and extraction out of PostgreSQL. There are 6 logical segments inside the CLAIMS Direct data warehouse.

Segment	Description	Size
work index	All indices pertaining to loading	30GB (variable)
work text	All raw table data queued for loading	100GB (variable)
xml index	All permanent indices for the data warehouse	400GB
xml text	All permanent text for the data warehouse	1TB
pg data	The cluster metadata, reporting, and logging directory	5GB (variable)
pg xlog	Log shipping for replication	50GB (variable)

Each of these segments can be allocated discrete disk space through the use of TABLESPACES. Although not required, the use of TABLESPACES will improve loading and extraction performance. The total logical size of the data warehouse is approximately 2TB after initial loading.

PostgreSQL Initial Configuration

CLAIMS Direct requires a working PostgreSQL cluster. If you do not have an initialized cluster, the following steps will initialize the cluster and give you rudimentary authentication and access levels needed to run CLAIMS Direct.

su - postgres

# -D, --pgdata: specify location of pg data segment

# -X, --xlogdir: specify location of pg xlog segment

initdb -A trust \

-D /var/lib/pgsql/data \

-E utf8 \

-X /pgxlog

The following configuration files are to be edited:

/var/lib/pgsql/data/pg_hba.conf

# Allow trusted local access

local all all trust

# IPv4 local connections

host all all 127.0.0.1/32 trust

# Other hosts on subnets that may require access, for example

# host all all 192.168.10.0/24 trust

/var/lib/pgsql/data/postgresql.conf

listen_addresses = '*' # what IP address(es) to listen on;

If you already have an initialized cluster, please be certain that local access is enabled for stand-alone installation. In either distributed install, if a separate services machine is created, its IP address needs access. This is imperative for the client update procedures.

Finally, enable and start the PostgreSQL cluster:

# on systemd based systems:

systemctl enable postgresql.service

systemctl start postgresql.service

CLAIMS Direct PostgreSQL Role

CLAIMS Direct requires the role alexandria:

psql -Upostgres postgres
=#> create role alexandria with superuser login;

Creating the Database

The PostgreSQL data warehouse portion of CLAIMS Direct is delivered in two parts:

PostgreSQL database schema (alexandria-dwh.sql)
<table>.gz files located in the sub-directory data

# create the database
cat alexandria-dwh.sql | psql -Ualexandria postgres

Tuning the Data Warehouse

pgtune (optional)

Although specifying exact postgresql.conf configuration parameters may seem beneficial, in reality, every installation is different. There are many factors in tuning your cluster including system memory, resource contention from other services running on the server, available disk space, disk types et al. The primary cluster supporting your updates was tuned using the tool pgtune. It is recommended to run the tool and apply the changes before bulk loading the data. Note that this requires Python.

yum -y install pgtune
  pgtune -i postgresql.conf  -T DW -c 500
  [ paste added configuration options at end of postgres.conf ]

Redirecting errors to LOG.2 (optional)

If desired, you can redirect errors (if any) to LOG.2:

cat alexandria-dwh.sql | psql -Ualexandria postgres 2>LOG.2

Pre-flight check

Use the pre-flight check to ensure the database and operating system are properly configured before loading the tables:

./pre-flight-check.sh

The sample output of a properly configured system would look like this:

# Testing localhost/alexandria ...
#   OK    : procedual language sql
#   OK    : procedual language plpgsql
#   OK    : procedual language plperl
#   OK    : procedual language plperlu
#   OK    : XML capability (test 1/libxml): 
#   OK    : XML capability (test 2/libxml):

Loading the Tables

The process of loading CLAIMS Direct data into PostgreSQL tables will take 1-2 days on most hardware configurations.

The following code block outlines the loading of the CLAIMS Direct data into PostgreSQL tables:

# for each table, load ...
gunzip -c data/<table>.gz | psql -Ualexandria alexandria
 
# sample batch script
for tbldata in $(ls data/*.gz); do
  gunzip -c $tbldata | psql -Ualexandria alexandria
done

Next Steps

Once the data has been loaded, proceed to:

installing the Alexandria client tools OR
setting up the update process

tblspc_wrkidx	work index	/wrkidx/_pg	30GB
tblspc_wrktxt	work text	/wrktxt/_pg	100GB
tblspc_xmlidx	xml index	/xmlidx/_pg	400GB
tblspc_xmltxt	xml text	/xmltxt/_pg	2000GB

Page tree