-
Notifications
You must be signed in to change notification settings - Fork 51
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Standard for expected v1.0 #217
Comments
we could "safely" register *maybe check compatibility with cooler ? here is a more detailed example - with several schemas for PS |
Nice, some good ideas there @sergpolly! I think providing |
seems there a few other properties in the above that weren't formalized that would be useful for a
|
We renamed |
ah yeah, I'll add that update to the 0.5.0 roadmap |
diag -> sep -> dist ... |
@sergpolly does recent commit close this issue? or should we convert to discussion? or add some todo for adding these as schemas (cc @nvictus ) |
oh - a couple todos:
everyhting else was addressed by the recent PR again will do as a PR with small fixes for expected |
done in #296 |
The goal will be to make all tools conformant to these formats and conventions.
Formats
The following formats are tab-separated when stored as text and must include a header line for column names. The
diag
columns are treated as "dense", meaning all submatrix diagonals, starting from 0, should be included.Intra-chromosomal regional
region
,diag
n_valid
,count.sum
,balanced.sum
, etc.Example:
Intra-chromosomal bi-regional
region1
,region2
,diag
n_valid
,count.sum
,balanced.sum
, etc.Example:
Inter-chromosomal bi-regional
region1
,region2
n_valid
,count.sum
,balanced.sum
, etc.Example:
Interpreting
region
columnsIn general, region names are given as mnemonic names. Their actual coordinates are specified in a separate BED-like region table with schema [
chrom
,start
,end
,name
]. As BED files, they generally do not have header lines when saved as text.Example:
In the simple case when using whole chromosomes as regions, the region table may be omitted. Application code should assume this is the case when a region table is not provided.
Amendment (2021-03-02) (amended by Ilya,
diag
->sep
)Intra-chromosomal regional will be dropped in favor of the bi-regional format. Symmetric intra-chromosomal zones will use the same name for
region1
andregion2
.diag
will be renamed tosep
to accommodate potential more exotic definitions of separation.region1
,region2
,sep
n_valid
,count.sum
,balanced.sum
, etc.Example:
The text was updated successfully, but these errors were encountered: