Chromosome names. A chromosome name, as defined by this module, consists of two parts. An optional prefix "chr" (case-insensitive), followed by a suffix identifying the chromosome. The possible suffixes (case-insensitive) are:
Functions for converting to an Arabic or Roman format are
provided. The benefit is that these functions convert to a
canonical form, and thus the resulting strings can be compared
reliably. For example, to_arabic "Chr4"
and to_arabic "IV"
both
return "4". The Arabic format is recommended over the Roman form
because the Roman form is incomplete; e.g. it cannot represent chromosome
number 10 because there would be an ambiguity with the maternal
chromosome "chrX".
module Biocaml_chr:
sig
module Error:
sig
typet =
[ `chromosome_ambiguous_in_roman_form of string ]
chr
's Roman
representation is ambiguous, e.g. "chr10" and "chrX" both lead
to "X"end
exception Error of Error.t
val to_arabic : string -> string
to_arabic s
returns the canonical Arabic representation of s
.val to_roman : string -> (string, Error.t) Core.Std.Result.t
to_roman s
returns the canonical Roman representation of s
.val to_roman_exn : string -> string
end