Skip to content

TriNetX

TriNetX is a web-based tool for research population cohort and feasibility queries that also enables researchers to collaborate with peers at other member institutions.

Why use TriNetX?

Through TriNetX and i2b2, users search for patients meeting specified criteria in a de-identified database, without prior Institutional Review Board (IRB) approval. Data are presented as unique patient counts, and a patient is counted only once. Data in TriNetX also exclude patients with only a medical record number or without diagnoses or codes. Such a search can help researchers determine whether enough potential patients are available to properly conduct a research study. With IRB approval and an enterprise information management request, patient-level data can be requested.

TriNetX also offers chart and graph options for data visualization and includes a rate-of-arrival algorithm. This algorithm determines how many patients matching certain criteria visited Penn State Health within the past three years, and then predicts how many potential visits will happen each quarter over the next year. A Trial Connect feature allows clinical research organizations and industry sponsors to determine and connect with potential study sites.

TriNetX is currently being used by nearly 30 of Penn State’s peers in the Clinical and Translational Science Award Program. It combines data with a global health research network, enabling health care organizations, pharmaceutical companies and contract research organizations to collaborate, enhance trial design, accelerate recruitment and bring new therapies to market faster.

Working with TriNetX

Request access, training and support for TriNetX and other research population tools

Jump to topic

Search

Learn More

Code Lookups Expand answer

TriNetX uses International Classification of Diseases (ICD) codes, Logical Observation Identifiers Names and Codes (LOINC), and Current Procedural Terminology (CPT) codes in its searches.

The following links provide lookup tables for these codes:

Data Refresh Schedule Expand answer

The TriNetX tentative data refresh schedule is listed here. Please note that the application will be unavailable between 8 p.m. Tuesday and 5 p.m. Wednesday on the dates listed unless otherwise noted. Please see the change log section of this page for the current state of the data.

Standard Data Set Expand answer
Change Log Expand answer
Support Expand answer


For general technical help, or to report an issue with Clinical and Translational Science Institute applications including i2b2, call 833-577-4357 or email help@pennstatehealth.psu.edu.

For account help, go to Research Population Portal.

For help with queries, please complete a service request form.

TriNetX Research Network

Penn State researchers have access to additional data outside the Penn State Health electronic medical record through TriNetX Research Network.

TriNetX Research datasets provide researchers access to de-identified patient data from a network of health care organizations.

What kinds of data come in a TriNetX dataset? Expand answer

TriNetX datasets include clinical patient data such as demographics, diagnoses, procedures, labs and medications – commonly referred to as real-world data.

The data in TriNetX datasets are:

  • Primarily from healthcare organizations electronic medical record (EMR) systems
  • Collected for the primary purpose of providing care to patients

The data in TriNetX datasets are not:

  • Claims data, data primarily collected for billing
  • Data collected for randomized clinical trials
Where do the data in a TriNetX dataset originate? Expand answer

Data in TriNetX datasets comes from health care organizations and other data providers. The data these entities provide primarily come from:

  • EMR systems
    • Structured data
    • Unstructured data processed by Natural Language Processing technology
  • Cancer registries
  • Other sources like genomic data from third-party genomic testing labs
What are the characteristics of the health care organizations that provide TriNetX with data? Expand answer

The majority of the health care organizations are large academic medical institutions with both inpatient and outpatient facilities. Most of these are adult acute-care hospitals with multiple facilities and locations. All are currently located within the United States and provide TriNetX with both inpatient and outpatient data. The data they provide represents the entire patient population at the health care organization. Most provide an average of seven years of historical data.

How are data transformed from the source? Expand answer

TriNetX typically receives data from health care organizations and other data providers in one of two ways:

  • TriNetX receives data directly from a health care organization research repository into the TriNetX environment.
  • A health care organization or data provider sends TriNetX data extracts in the form of CSV files TriNetX Data Dictionary.

TriNetX maps the data to a standard and controlled set of clinical terminologies and transforms it into a proprietary data model. This transformation process includes extensive data quality assessment that includes data cleaning that rejects records that don’t meet the TriNetX quality standards.

How current are the data? Expand answer

One of the distinguishing characteristics of the TriNetX dataset is that it is continuously refreshed. Health care organizations and other data providers update their data at various times, with over 80 percent refreshing in one-, two or four-week frequency intervals. The average lag time for a health care organization’s source data refresh is one month.