RFC009 Title: DASBED Format Author: Jim Kent Version: 1 Date: 18 August 2001 I'd like to propose a format that is closer to what's native for the UCSC browser. I think it's more flexible, a bit better defined, and definitely denser than DASGFF.   Here's the DTD with comments to the right         Top level data structure          Some database ID.        One or two word label. Defaults to id.  One line description. Defaults to label.         Version must be floating point number    Overall author list.         Top level URL.  See below for details                A tag around a possibly long description.          Encloses annotations around a segment of genome      Sequence id (as in DASGFF's SEGMENT)    Start coordinate (1 based)      End coordinate (inclusive) Goes with sequence id.     One or two word label. Defaults to seq:start-end  One line description.  Defaults to label.                A track of data over a specific segment           Some database ID.        One or two word label.  Defaults to id. One line description. Defaults to label.       Floating point number.        Track author.  Defaults to DASBED URL.          Track level URL. Defaults to DASBED URL.                An item on a track, may contain multiple blocks.        Start of first block relative to segment start.          End of last block relative to segment start.           Some database ID.         One or two word label.  Defaults to id.   One line description.  Defaults to label.       Optional version here mostly for curated items.        Optional author mostly for curated items.          Item level URL.  Defaults to TRACK URL.        +, -, or not even present.         A number between 0 and 1000.    CDS start for genes.      CDS end for genes.           ID of query sequence in an alignment.                           A block.  Can hold alignment info.  Deliberately small.        Start in sequence relative to item start.          End in sequence relative to item start.         Score between 0 and 1000.          Query side of alignment including '-' chars.          Target (in segment) side of alignment.        Query start coordinate.          Query end coordinate.   An example:      This is data from the UCSC genome browser/database          based on the April 2001 assembly on the Wash U map.                                                          I'm dubious about this one. -jsbach@genoscope.fr                                      These Mouse/Human alignments were generated with BLAT using                 a 7 of 8 seed at the amino-acid level.  The mouse sequence                 was generated by the Mouse Sequencing Consortium and                 assembled with Arachne.                                                               The href's can have the following variables embedded in them:    $I - replaced with the item ID.    $T - replaced with the track ID.    $G - replaced with the global bed ID.    $S - replaced with sequence ID    $A - replaced with start of item.    $Z - replaced with end of item. These will be substituted before any linking takes place.   Most of the time it will be possible to specify the URL at the track or even global level.