Data generated by the TARGET initiative is accessible through a tabular, easy-to-use Data Matrix*. TARGET projects are ongoing, and new data will be incorporated into the Data Matrix as they are generated, along with an update of the matrix version history. Users should note the version of the TARGET Data Matrix when accessing information.
* Please note that the TARGET Data Matrix schema, both content and format, is evolving over time to meet the needs of the research community. Comments, questions, and suggestions for improvements are encouraged and should be sent to firstname.lastname@example.org.
The tabular matrix links to as many as four levels of data, from raw data files to summarized findings.
Specifically included in the matrix are:
- names of diseases studied
- clinical information, including outcomes
- types of molecular data generated and platforms used
- metadata descriptions about each individual project
- multi-level chip-based and sequencing data links
To obtain specific datasets or metadata, including descriptions of each project, users can hover over the text within the table and click to access the appropriate files. The pilot TARGET projects, acute lymphoblastic leukemia (ALL) and neuroblastoma (NBL), are separated by phase: Phase I, the pilot portion of the initiative; and Phase II, expansion through ARRA funding. Note: NBL data are color-coded by phase.
Comprehensive TARGET Data Matrix user guide – coming soon!
Both open- and controlled-access tiers of TARGET datasets are available for each cancer studied through the Data Matrix. Controlled-access tier data requires approval and user accounts to obtain. Specifically, any protected data stored at the TARGET DCC (all chip-based and upper level sequencing data files) can be accessed with an NCI user account, while lower level trace sequencing files and aligned reads from next generation sequencing (BAM files) require an eRA commons account to download. Each is depicted separately within the TARGET Data Matrix. Visit the Data Access page for more information.
Access tiers indicated on the matrix:
Blue = open access
Red = controlled access (NCI & NCBI)
Black = unavailable
A variety of open-source tools can be used to visualize, explore, and integrate TARGET data. Learn more about some of the tools available to use with NCI large-scale genomics data.
Performing Bulk Downloads
TARGET datasets are not currently available for bulk download through the TARGET Data Matrix; however, this feature will be available in the future.