Genome Track Analyzer or AnCorr is the set of tools for genome-wide study of genomic features and their correlations.

The broad class of genetic and epigenetic tasks can be reduced to the study of features distributed over the genome (genome tracks).
Our server contains the following tools applicable to genomic track investigations:

Methods and software based upon the analytical criteria enable rapid and efficient processing of huge amount of data stored in genome-scale datasets.
We have developed criteria for assessment of genome track inhomogeneity and of correlations between two genome tracks.
The full description of the criteria and their implementations can be found in the following article.

Kravatsky Yuri V., Chechetkin Vladimir R., Tchurikov Nikolai A., Kravatskaya Galina I.
Genome-wide study of correlations between genomic features and their relationship with the regulation of gene expression.
DNA Research 2015, vol.22, issue 1, pp. 109-119, DOI: 10.1093/dnares/dsu044, PMID: 25627242

Attention! Genome Track Anazyler commandline version is available at request!


This work is supported by RFBR grants no 14-04-01638 A, 17-04-02152 A

Warning: all methods and criteria implemented at this server are statistics-based.
The number of pairs/points should be large enough. We chose the threshold of 50 pairs/points.
For lower number of pairs/points, some parts of the results are displayed in red,
denoting an insufficient number of points for statistics.
The following example shows such a correlation table row:

Example of insufficient pairs
Chromosome z Correlation p z p Ordering p z Pairs1st set2nd set
chrY-1.2520.21047-1.6980.0895617333270

General layout of methods and criteria

Note that AnCorr can find both positive (correlation) and negative (anticorrelation) correlations.

AnCorr calculates the ordering information (zp) about datasets too.
Positive statistically significant value means that the points/streches in dataset 1 precede points/stretches in dataset 2.
In the resulting table it is displayed as "1⇒2". Negative statistically significant value means the reverse situation and is displayed as "2⇒1".
Output for all methods and criteria uses a common color layout.

Thresholds, colors and p-values
z valuesMeaning
|z|<1.8Insignificant, no correlation
1.8≤|z|<1.96Fuzzy correlation (fuzzy in results)
1.96≤|z|<2.58Significant correlation, p-value<0.05
|z|≥2.58Higly significant correlation (strong in results), p-value<0.01

Correlations between point-wise and stretch-wise genomic tracks

This tool compares genome tracks by algorithm robust to inhomogeneity of genome tracks. Comparison of tracks is perfomed with their attribution relative to DNA strands.

    How to compare/calculate correlations between genome tracks using this tool:
  1. Set up files for Dataset 1 and Dataset 2. AnCorr can upload files only from your local computer, no URL treatment.
  2. Set up file formats for both datasets properly. AnCorr cannot autodetect file formats.
  3. Set the strand for correlation calculation. By default strand information is ignored
    (both strands are taken into consideration)
  4. For BED/BED6/BED12/GFF formats you can set correlation point too.
    By default it is the midpoint of stretch/segment, you can change it to
    the start of each stretch/segment or to the end of each stretch/segment.
  5. You can select type of processing too. By default AnCorr process tracks chromosome-wide.
    You can change type to genome-wide calculation. When you select this type of processing then all
    data each of two tracks is joined without gaps with respect to minimum/maximum data coordinates
    for each chromosome at both tracks.
  6. Results are displayed as color-coded table.
    For each chromosome you will obtain the result in the one of the following terms:
    empty cell (no significant correlation), in the other cases the color of cell will indicate
    type (positive/negative) of correlation and its strength.
    To copy results to the MS Excel/Libre Calc, press "Select Table" button then use Copy/Paste feature
    of your operating system. If you want to copy/paste resulting table with color output you can use
    either of browsers: Google Chrome, Internet Explorer, Safari, Opera. Firefox or Palemoon cannot
    copy/paste coloured table, even with plugins. It is known very old bug. You can paste coloured table
    only to Microsoft Office products. Libre Office Calc doesn't support coloured tables pasting.

Input formats are:

SGR/TXT can be used in one of the following notations:
Text format without strand information:
chr23 49302345
Text format with strand information:
chr23 49302345 +

BED/bedGraph can be used in one of the following notations:
BED without strand information (BED3):
chr23 49302345 49305671
BED with strand information:
chr23 49302345 49305671 -

BED6/BED12 full-featured BED format:
chr7 127474697 127475864 Pos4 0 +

GFF/GFF2/GFF3 full-featured GFF formats:
chr2R CisGenome protein_binding_site 8972000 8975706 196.39 + ID=region_299

By default:

AnCorr doesn't support file format autodetection, so please set correct file types!


Correlations between point-wise and stretch-wise genomic tracks and expression profiles

This tools is designed to compare profiles and/or genome tracks by robust method with their attribution relative to DNA strands.
Using this tool, you can compare/correlate genomic tracks and DNA expression/binding profiles in any combination.

    How to compare/calculate correlations between genome tracks using this tool:
  1. Set up type of data for each file. It can be "Profile" or "Track". The server displays the set of options available for the current data type only.
  2. You should choose the correct peak caller depending on your data. If you use pre-aligned sequenced profile data you should choose
    MACS peak caller. MACS can process your data in CHiP-Seq specific or in general (--nomodel) mode. If you use preprocessed profile
    data in formats like SGR/TXT or non-overlapped BED/WIG you should choose bult-in AnCorr peak caller. The further instructions
    will significanly depend on the chosen peak caller.
  3. You can select type of calculations too. By default AnCorr process tracks chromosome wide.
    You can change type to genome wide calculation. When you select this type of calculation all
    data each of two tracks is joined without gaps with respect to minimum/maximum data coordinates
    for each chromosome at both tracks.
  4. Results are displayed as color-coded table.
    For each chromosome you will obtain the result in the one of the following terms:
    empty cell (no significant correlation), in the other cases the color of cell will indicate
    type (positive/negative) of correlation and its strength.
    To copy results to the MS Excel/Libre Calc, press "Select Table" button then use Copy/Paste feature
    of your operating system. If you want to copy/paste resulting table with color output you can use
    either of browsers: Google Chrome, Internet Explorer, Safari, Opera. Firefox or Palemoon cannot
    copy/paste coloured table, even with plugins. It is known very old bug. You can paste coloured table
    only to Microsoft Office products. Libre Office Calc doesn't support coloured tables pasting.

Input formats for genomic tracks/stretches are:
SGR/TXT can be used in one of the following notations:
Text format without strand information:
chr23 49302345
Text format with strand information:
chr23 49302345 +

BED/bedGraph can be used in one of the following notations:
BED without strand information (BED3):
chr23 49302345 49305671
BED with strand information:
chr23 49302345 49305671 -

BED6/BED12 full-featured BED format:
chr7 127474697 127475864 Pos4 0 +

GFF/GFF2/GFF3 full-featured GFF formats:
chr2R CisGenome protein_binding_site 8972000 8975706 196.39 + ID=region_299

Input formats for expression and/or DNA-protein binding profiles are:
SGR/TXT The same as for points above, for profiles the strand field is ignored even if it is presented.
BED/bedGraph The same as for stretches above, for profiles it should be bedGraph with no strand field.
WIG full-featured Wiggle track format
bigWig binary condensed version of WIG, description here: bigWig track format

By default for genome tracks:

By default for expression and/or DNA profiles:

This tool transparently supports gzipped files. You can upload .sgr.gz, .bed.gz. and .wig.gz files to AnCorr.
AnCorr doesn't support file format autodetection, so please set correct file types!


Assessment of genome track inhomogeneity

When studying correlations between genome tracks, it is useful to begin with the preliminary assessment of input data.
In our package the (in)homogeneity of length distribution related to the genome tracks is assessed by two methods.