AGCoL SyMAP Queries UA
SyMAP Home | Download | Docs | Input | System Guide | User Guide | Tour
Content:
         > Instructions        
> Query Setup        
General
Single genes
Pair hits
Groups of hits
Result Panel  
Result table    
Columns    
   Pair, Single, Auto-save    
Statistics
Top buttons
   View 2D, Gene Report...
3-chromosomes
> Results

See Release for the latest v5.6.1 and v5.6.2 changes.

See Terminology in the User Guide, especially the Cluster Hit Algo1 versus Algo2 description.


> Instructions

To open the query interface, first select two or more sequence projects in the Project Manager. Then select the Queries button at the lower right to open the queries interface.
The Instructions window (on right) lists the projects which were selected for querying.

The Notes states what the Olap column represents; if one or more project pairs used Algo1, it will be gene overlap; if all project pairs used Algo2, it will be exon overlap.

Open the Query Setup window by clicking on its tab in the left panel.

query_oview

> Query Setup

Go to top
      Sections:    1. Rules    2. General    3. Single genes    4. Pair Hits    5. Groups of hits

1. Rules

Go to top
Set up the desired filters and then select Run Query to execute it. When the search is complete, the table result panel will be displayed.

1. Most filters can be used in conjunction with other filters; options will be disabled if they cannot be used with a selected filter.

2. The Single queries returns rows without a hit, and only list genes for one project.

3. All Pair hits queries return a row per hit pair, i.e. aligned region between to two chromosomes from two projects.

query_query
4. All Hit# are uniquely numbered for a chromosome pair, e.g. there will be a Hit# 3 on all chromosome pairs that have hits. query_table_rows
5. All Gene# are sequentially numbered per chromosome, e.g. there will be a Gene# 3 on all chromosomes that have at least 3 non-overlapping genes. Overlapping genes have the same number, but different suffixes, e.g. 4.b.

6. Major and Minor genes:

  1. A hit can align to multiple genes on either or both chromosomes; it is assigned to the best (major) gene on each end.
  2. For example, in the image on the right, the pink gene is the major gene as the hit (pink line) fits it best, whereas the burgundy gene is the minor gene.
  3. When queries involve genes, only the hits to the major genes will be shown except for the following queries, which will also list the minor genes:
          Every*, Hit#, Gene#, and Multi with Minor* checked.
Hit overlaps two genes

Run/Stop Query:

Wait for the current query to stop running before starting another!
It is not possible to start a new query from the same panel, but you could start one from a different table; they both will fail, so do NOT do this!

Stop
When a query is running, you will see a status line and the Stop button. You can stop the query which will take the right hand view to the Results Panel.

It cannot stop the database search, but once the search is done, it stops all subsequent processing. You will typically see the status line change 3 or 4 times before showing the table.

Query Stop
Query Stop

2. General

Go to top
Annotation Description
Enter a substring: the entire annotation string (i.e. column All_Anno) will be searched for the substring. Hits will be shown that have at least one gene, of two possible, with the annotation.
General

Location

Chr  Select a specific chromosome for the species.
From  The Hit Start coordinates for the selected chromosome will be >= this number.
To  The Hit End coordinates for the selected chromosome will be <= this number.

It is valid to enter only the From or To, or leave both blank.

The From and To are disabled for Single genes, Block#, Collinear set#, Hit#, and Gene#.

3. Single genes

Go to top
The Single queries produces rows of genes; there is no hit or block information since the rows do not represent hits. Single
Options:
  • Orphan genes (no hits)
    Genes that do not a have a hit and meet the additional filters. The orphan genes are relative to the projects shown on the Instruction page. For example, if species X, Y and Z have synteny computed between all pairs, but only X and Y are selected, the orphan genes for X would be those with no hits to Y. If X,Y&Z are selected, the orphan genes for X would be those with no hits to Y and Z.
     
  • All genes (w/o hits), i.e. genes with and without hits
    This shows all genes that meet the additional filters, regardless if they have a hit or not. There is always the same set of genes for a project, regardless of synteny.
Unselect species: When the Single or Gene# is selected, the check boxes beside the species names will be activated. In order to view the genes from just one species, deactivate the others. Additionally, a single chromosome can be selected, but location cannot be entered. Single

4. Pair hits

Go to top
Each hit connects two species (projects) and hence represents a pair of aligned regions for two of the selected species.

Filters are as follows:

Pair

In Block (Synteny Hit)

Yes Only hits that are part of a synteny block will be returned. All hits will have a value for the Block column.
No Only hits that are NOT part of a synteny block will be returned. No hits will have a value for the Block column.

Annotated (Gene Hit)

Every  Only hits that align to a gene on one or both sides of the hit will be shown. The Gene# columns will list the best gene.
Every*  This is like the Every option, but a hit may be listed multiple times if it aligns to minor genes. The minor Gene# will be suffixed with an "*" and the Hit Type is unknown.
One  Only hits that align to a gene on ONE end will be shown. The Gene# column will show the best gene for the end with the hit.
Both  Only hits that align to genes on BOTH ends will be shown. The Gene# column will show the best gene for both ends of the hit.
None  Only hits that do NOT align to a gene on either end will be shown. The Gene# column will not have a value for either end of the hit.
If a species chromosome is selected with One and a species chromosome selected, the hit must be on the species chromosome, but the "one" gene can be on either end. Same for Every or Every*.

Collinear size

>= [=, >=] N   List all hits in collinear sets that have size >= N or = N or <= N, respectively.
Ignore   Do not filter on collinear set sizes.
See Collinear, which explains the SyMAP collinear sets.
Go to top

For the following 4 filters, do not include the 'Chr' number. Instead use the chromosome pull-downs to narrow the search to a specific chromosome, as exampled in Block#.

Block#
Enter a single block number (the Block column is formatted Chr.Chr.Block#).
This will display all hits with this block number from any chromosome pair.
Example using the chromosome pull-downs: if you select Chr 1 from the first project, Chr 2 from a second project, and enter block=3, you will see hits in block 1.2.3.

Collinear Set#
Enter a set number (the Collinear column is formatted Chr.Chr.Size.Set#).
This will display all hits with this set number from any chromosome pair. See Block example.

Hit#
Enter a hit number. Both major and minor gene hits will be shown.
This will display all hits with this number from any chromosome pair. See Block example.

Gene#
Enter a Gene# number (the Gene# column is formatted Chr.Gene#.suffix, where suffix may be blank).
A gene will only show if it has a hit. All hits with the Gene# on either end will be shown.
If a gene has a suffix:
 • If only a number is entered, all genes with the numeric prefix will be displayed (including minor hits).
 • If a number.suffix is entered, the exact gene will be displayed, including minor hits.
Select a chromosome:
 • The hit has to have one end to the chromosome, but the Gene# can be on either end.
 • By unselecting the other species (see Unselect species), the Gene# will only be on the specified end.
Group:
 • When a gene hits multiple places on the same opposite chromosome, the hits are put in a group
   (see Group column), which can be viewed with the 2D Group pull-down option (similar to Multi-hit).

4. Groups of hits

Go to top
   Sections:     1. Multi-hit genes    2. Cluster genes   

These two options are computed on the fly. They produce query results with values for the Group column; this in turn allows the View 2D Group option to be used.

4.1. Multi-hit genes

Multi

List all genes that have >= N hits to the same species. To view the groups merged, see Multi-hit gene report.

  • The target gene refers to the gene with >= N hits to the opposite species.
  • The target must be a gene, but it can be to non-gene hits on the opposite species.
  • Used with the following filters:
    • Both (Annotated Genes): if selected, the opposite species must also have a gene hit.
    • Chr: If one or more chromsomes are selected, the target genes will only be on those chromosomes.
  • See Multi-hit Gene Report... for a good way to see a condensed view of the results.

The options are as follows:

 Minor*  Include minor hits on either chromosome.
  Tandem  (Annotated species only): The >=N hits must be to a tandem array of genes.
Same Chr is automatic with this option.
  Same Chr  The >=N hits must all be on the same opposite chromosome.
  Diff Chr  The >=N hits may be on any set of opposite chromosomes for a given species.
Go to top
The table can list the same hit multiple times, because gene X and gene Y may be connected by a hit, and both genes have >=N hits, so the hit in both groups need to be shown.

An example is shown on the right, where Hit#881 is in both Atha Group#3 and Brap Group#15. The image below shows the two groups, where the group hits are highlighted in magenta; these were produced using the View 2D option Group (selecting any hit in the group results in the same 2D display).

multi-hit results

multi-hit 2D

4.2. Cluster genes

Go to top
Note: this algorithm was written for v5.6.3.
  • This algorithm finds clusters of overlapping genes.
  • Each cluster is given a group number, which is shown in the Query Results table (column name Group).
  • These can be pre-filtered by species chromosome, etc.
  • Note, if you have more than 6 species selected, this stage can take an hour or more.
There image on the right shows the filters.

The first fitler, >= N hits, is always applied.
Then select a second filter from the following:

  Cluster3
>=N List all clusters that have >= N hits.
Include The resulting clusters must have at least one gene from all included species.
Exclude The resulting clusters will contain no genes from the excluded species.

PgeneF: If you start viewSymap with the "-g" flag, you will see the filters from the original PgeneF cluster algorithm. →Note: the PgeneF feature has not been tested for a long time except superficially (i.e. making sure it has the same results as the previous release). It remains available until the Cluster gene algorithm is enhanced to include non-gene hits.

If you start viewSymap with the "-g" flag, you will see the filters as shown on the right. The one addition to v5.6.3 is the >= N hits filter. Cluster3
No gene only shows clusters without annotation. The second two filters only occur if there is >2 species. From the original documentation: Linkage: Require the PgeneF to be fully linked, i.e. for each pair of species A and B in the group, there must be a hit linking A to B. At least one: Only PgeneF hits will be shown if they have hits to the included species, although the PgeneF numbers will reflect groupings created using all hits.

Result Panel

Go to top
   Sections:     1. Results table    2. Columns    3. Statistics    4. Top buttons    5. 3-chromosomes   

1. Results Table

A pair hits table
query results
  • Pair Hits:
    • The table contains columns for all of the selected species, but each hit only connects two species, and the other species columns are empty.
    • Each Hit# is only listed once unless minor genes are included (see Rules).
    • A gene may be listed more than once if it has multiple major hits.

  • Single genes:
    • If the query specified Single genes, then each row represents one gene and shows data only for one species.

You can sort the columns by clicking the column name in the table, and rearrange them by dragging the column name. You can add/remove columns using the Select Columns button at the bottom.

2. Columns

Go to top
   Sections:    1. Pair hits columns    2. Single gene columns    3. Auto-save columns

query columns

The buttons on the bottom will be Select Columns and Hide Stats. If Select Columns is selected, it changes to Hide Columns and the Hide Stats is replaced with the 3 buttons explained below. query results
ClearClears the selection of all columns except Row#.
DefaultsSelects the default columns, which are shown in the image above. If the collinear query was run, this will include the Collinear. If the group query was run, this will include the Group.
ArrangeArranges similar columns, putting the gene columns first.

In the column panel shown above, hover over a column name to see its brief description. Following are the full descriptions of the columns.

2.1 Pair hits columns

Go to top
General
Row  Row number. This column does not sort.
Block  The synteny block containing this hit (if any). The format is Chr.Chr.Block#, where the two "Chr" are chromosome numbers.
Block Hits  The number of hits which comprise the synteny block.
Collinear  The collinear set containing this hit (if any). The format is Chr.Chr.Size.Set# (e.g. 1.2.5.100; there are 5 adjacent gene hits in set# 100 on Chr1 to Chr2).
Group  Gene#, Multi-hit gene, Cluster: These three queries produces groups of hits, where each group has a group number and size. The format is Size.Group#; clicking the column heading sorts by Group#. These numbers are generated during the search so will not be the same for different filter settings.
Hit#  The number assigned to the hit. They are sequential along the chromosome of the alphabetically lesser species, e.g. Arab<Brap.
Hit %Id  Percent identity of the alignment. The value of the "Identity" column is from the MUMmer file. If the hit has subhits, then this is an approximation.
Hit %Sim  Percent similarity of the alignment (as determined by the BLOSUM scoring matrix). The value of the "Similarity" column is from the MUMmer file. If the hit has subhits, then this is an approximation.
Hit #Subs  The number of subhits in a clustered hit.
Hit St  The strand for the two ends of the end separated by a "/".
Hit Cov  The summed subhits within a clustered hit taking into accounts overlaps. The summed subhits are usually different for the two sides; this will be the longest.
Hit Type  There are two alternative algorithms for clustering the hits on database creation, which assign different hit types, as follows:
Algo1: g2 (two genes), g1 (one gene), g0 (no genes).
Algo2: E is exon, I is Intron, n is intergenic. There will be 2 characters, one for each gene, where the 1st letter goes with the alphabetically lesser project name;
e.g 'EI' would indicate the hit covers A.thal exon and Cabbage intron.
Gene&Hit Info: one row for each species
Chr  Chromosome of the hit.
Gstart/Gend/Gst  Start and end of the annotated gene. The Gst is the strand (+/-).
Gene#  The gene number is C.#.{a-z}. The C is the chromosome number. The # is the sequential number along the chromosome. If a run of genes overlap, they receive the same gene number with different suffixes {a-z, a2-z2, etc}.
Hstart/Hend  Start and end of the hit region.
Hlen  Hend-Hstart+1
Olap  The value depends on which Cluster Hit algorithm was used.
Algo1: If any of the project pairs used Algo1, then this column will be the gene overlap.
Algo2: If all of the project pairs used Algo2, then this column will be the exon overlap.
Annotation: one row for each species
The keywords for the annotations of each species are listed; they can be different for each species. See GFF Attributes for modifying the keywords shown. The Anno Key Count can be modified at any time using symap (not viewSymap).

2.2. Single genes columns

Go to top
The single genes table only has the Gene Info and Annotation columns, with one additional column, as follows.
NumHits This is the number of hits to the gene in the ENTIRE databases, except for SELF synteny.
For example,
→ if Arab, Brap and Cabb species have all been compared for synteny,
→ and only Arab-Brap or Arab-Cabb are being queried, they will have some rows with NumHits>1,
→ and the Arab-Brap-Cabb query will have all NumHits=0.
This is illustrated below, where gene# 1.2 is the first table because it does NOT have a hit to Cabb, but it has NumHits=1 because it DOES have a hit to Brap.

Arab-Cabb orphans
query single results2
  Arab-Brap orphans
query single results
  Arab-Brap-Cabb orphans
query single results

2.3 Columns and order shown

Go to top
During a SyMAP session, when you display a new table, it will use the columns and order from the last table created or modified (add/remove columns).

The selected columns are saved between sessions (described below), but the order is not.

2.4 Auto-save columns

The columns selection is saved in a file called .symap_saved_props in the user's home directory so that the next time you viewSymap, the table will show the same column (but in their default order).

If you have multiple SyMAP databases, when you change between them the columns displayed are relative to the last SyMAP database queried (they may seem some what random to a different SyMAP database).

3. Statistics

Go to top
Statistics for the query results are shown at the bottom of the results table. They can be hidden by selecting Hide Stats.

result stats

Most of the statistics are self-explanatory except the following:

Annotated and Genes: The first is the number of hits that overlap one or more genes, where a gene can have multiple hits. The second is the number of Genes with at least one hit.

Groups: This statistic is only shown if the Group column is populated.
For Multi-hit Genes: there is a column with the number of groups with each of the other species. If there is over 2 species, this will also include the highest group number in parenthesis. For example,

  A.thal  Groups: #50 (50)    #142 (192)
  B.rapa  Groups: #618 (810)  #984 (1794)
  Cabbage Groups: #847 (2641) #495 (3136)
   There are 3,136 total groups.
A.thal has 50 with B.rapa and 142 with Cabbage, where the first set is 1-50, the second is 51-192, etc.

4. Top buttons

Go to top
   Sections    1. Show    2. MSA...    3. View 2D    4. Export...    5. Gene Report...    6. Search...

query buttons

The Unselect All unselects any selected rows.

4.1 Show

Go to top
For the selected row, a popup will show all columns and associated information for the hit. The text in the popup can be copied.

4.2 MSA...

Go to top
Below is an image of a multiple alignment. The right half shows the sequence, where the first row is the consensus: A 'n' (purple) is used where there is no majority base. Lowercase (green) is used if there is a least one mismatch. The scroll bar is not shown, as it is on the bottom of the panel. query MSA

Select one or more rows from the query table. The sequences of the selected hit(s) are written out and a multiple alignment is created using either MAFFT (Katoh 2013 MBA:30) or MUSCLE (Edgar 2004 NAR:32). MAFFT can be run with multiple CPUs and the optional --auto option, where --auto takes longer but allows MAFFT to determine the best algorithm to use.

Merge overlapping hitsInstead of repeating the alignment of identical regions, overlapping hits will be merged. The above shows that Atha.01 has 5 merged hits.
Use gapless hitsMultiple subhits can be clustered into one hit, where there are gaps between them. The subhits are concatenated together with 5 N's in between each non-overlapping subhit. This is important for long genes as found in humans, where the are often to exons with large gaps covering the introns.
None of the aboveAlign from the start coordinate to the end coordinate of each end of each selected hit.
TrimThere can be a long set of bases at the beginning or end of one sequence, these will be removed.
query align menu
MAFFT is considerably faster than MUSCLE on long sequences; however, MAFFT can take a long time if aligning over 300,000 bases, which is why the Gapless option exists.
query MSA banner While aligning, the line at the top of the panel will state the number of bases aligned as shown on the right.

The Stop does not stop MAFFT or MUSCLE, that must be done manually.

General Notes:
Wait for the current MSA to stop running before starting another!
It is not possible to start a new MSA from the same table, but you could start one from a different table; they both will fail, so do NOT do this!

Stop: This removes the tab from the left panel, but it does not stop the execution of MAFFT or MUSCLE.

  • Their processes will be called muscle for MUSCLE and distbfast for MAFFT; MAFFT may have some other process running, but distbfast this seems to be the one that takes the most time/memory.
  • To stop: you can type top at the command line, copy the 'process number' beside distbfast or muscle, then at the terminal, type kill -p 'process number'.
  • Make sure distbfast or muscle are stopped, as they take a lot of memory!
    However, if you do not see it, then it has completed.

MAFFT Notes:
→ All executables were removed that did not seem necessary. However, I may have removed one(s) that are used in certain unusual situations when --auto is used. Try again with --auto unchecked.
→ On Mac, if you are using MAFFT --auto, you may get a "Cannot be verified" for dndpre. See
MacOS and external programs to fix this.

4.3 View 2D

Go to top
query 2D

This displays the 2D view for the selected entry (see 3-chromosomes). The region displayed can be specified by the drop-down beside the View 2D button, as follows:

OptionColumn*Selected HitDisplay Filter
Region N/A The hit is padded to each side by the amount indicated in the kb text box. Show all hits.
Collinear Collinear The entire collinear set of hits for the selected hit will be shown. Show Block and CoSets, whereCoSets is collinear sets.
Block Block The entire synteny block for the selected hit will be shown. Show Block Hits.
Group Group The entire group of hits from the two chromosomes of the selected hit will be shown. Show all hits
*The selected row must have a value for the column.

High checkbox If selected, the selected hit is highlighted in the Popup-query color (default magenta). The coloring can also be turned off by selecting the 2D Hit Filter Hit popop (or Query) option.
Gene# checkbox If selected, the Gene# will be shown beside each gene in the 2D display, else the Annotation box will be shown (see 2D image on lower right).
Example: Go to top

The image on the right shows a Collinear set of 5. After the initial display, the 2D view can be changed as described in the User Guide.

 

The table below shows results when the Group column has a value (using Gene#, Multi-gene or Cluster search). The Group option can be used with these rows.

query 2D columns

query synteny

4.4 Export...

Go to top
One or more rows can be selected for the following exports; or if no rows are selected, the entire table is exported.

CSV: Export the rows using the selected set of columns to a CSV format suitable for import into Excel.

HTML: Export the rows using the selected set of columns to a HTML format suitable for viewing on the web (e.g. Example).

FASTA: (Pairs Only) For each row, the two hit sequences from the Hstart to Hend are written to file. NOTE: this is a very slow function and takes minutes if many rows are selected.

The Include Row column option is available because this column is alway present, but it may be desirable to not include it in the output.

result export

4.5 Gene Report...

Go to top
   Contents    Interface    Gene report    Collinear report    Multi-hit gene report    Cluster gene report   
  • The report is on the genes in the query table; hits without paired genes are ignored.
  • Unless the query is for Collinear, Multi-hit or Cluster genes, a Gene report will be produced.
  • This is most relevant when used with >2 species. It has not been tested for >4 species.
  • Gene, Collinear and Multi-hit genes produce a reference-based report whereas the Cluster genes does not; however Cluster genes shares most of the same options.

4.5.1 Reference-based Interface

The menu on the lower-right shows the options that will always exist. There may be additional options in between the middle lines, depending on the condition used to create the table, as discussed below.

Reference: In all cases, a reference must be selected; all reference genes are listed in a column, and any genes it links to (has a hit) are referenced in the other species columns.

Gene Annotation Columns: One or more annotation keywords can be entered in a comma-delimited list (e.g. product, ID); the keywords must be found in the All_Anno keyword column. A column will be created containing the values of all entered keywords. Columns can be entered for any of the species (not just the reference).

report menu gene
Gene Annotation Columns
WidthDetermines the length of the annotation before wrapping around; relevant to long descriptions.
Truncate Truncate long description using the width specified.
Gene#Show the Gene# before the gene annotation text.
Show
BorderDraw a border around each cell of the table. Otherwise, there will only be a line before each group of genes.
Create
PopupDisplays a panel of the results, which will look just like viewing the HTML file.
HTML File Writes a file that can be viewed as a web page. It is written in a human readable form such that anyone with HTML knowledge can edit it.
TSV File Writes a tab-separated-values file that can be viewed with Excel or any editor.

Additional options for >2 species: The following options will be shown between the middle lines if there are >2 species. The options are explained with the respective report.

Link: This term is used to indicate that there is a hit between two species, typically non-reference species.

report menu options

For all reports:

  • The reference column is in italics. In the TSV file, the reference header has an '*'.
  • The SyMAP Gene# is used in the report, which provides the chromosome number and order of genes.
  • The annotations correspond to the genes listed for the species. A "---" indicates that there is a gene with no corresponding annotation. If there is more than one gene in a column, the corresponding annotations will be separated by a ";".
The following discussion will all apply to results from 3 or 4 species.

4.5.2 Gene Report

Go to top
Gene - additional options for >2 species
All speciesPer RowOnly show rows with all species.
 Link+Only show rows with all species and at least one link.
 All Link Only show rows with links between all species (this often produces no results).
ShowLinks Show the links. Since All Link is completely linked, this can be unchecked.

The report lists all genes from the reference species and what genes each aligns to. The image below shows the top of two different All Link reports, one with Hsa17 selected as the reference and the other is with Mus11 selected.

Rows are always merged that share the same reference. The top line shows there are 15 single rows (one gene for each species) and 0 multi rows (multiple non-reference genes for at least one species). There are 2 multi rows in Mus11, where one is shown in the 3rd row.

A "=" indicates that the gene is linked to the gene next to it (ignoring the Reference column); a "+" indicates it is linked to the gene over one column (only happens when there are 4 species).

report A.thal ref

4.5.3 Collinear Report

Go to top
Collinear sets are grouped to show the union of overlapping sets for the reference gene.

Collinear - additional options for >2 species
All speciesPer UnionThe reference gene must be in a collinear set with every species.
ShowCollinearDisplays the collinear set beside each gene.

The report on the right shows the first union of the A.thal collinear sets.

This was generated with Per Union checked.

The '1 [3 sets]' indicates it is the first union with 3 collinear sets.

For the non-reference gene columns, by default, the collinear set size and number (size.setN) is shown.

For example, row 4:

Gene#Species ChrCollinear set
59A.thal Chr016.6, 8.104, 8.1
5153.aB.rapa Chr096.6
28B.rapa Chr108.104
1Cabbage Chr058.1

The other genes in each collinear set are obvious since they share the same size.setN, e.g. #8.1 is shown in 8 rows.

report A.thal ref

4.5.4 Multi-hit gene report

Go to top
Shows reference genes that align to at least N genes, where N is the number input on the query panel.

Multi-hit gene - additional options for >2 species
All speciesPer RowOnly show rows with all species.
 IgnoreShow all rows.
ShowGroupShow the group (size-number) above the respective.

The Group (size-number) is shown above each group. This used the Gene# option so it prefixes the annotations.

report A.thal ref

4.5.5 Cluster hit report

Go to top
The cluster report has no reference, as shown on the right (i.e. no radio buttons for selection).

Cluster gene - additional options (for any number of species)
DisplayLinked rows Each row will be linked genes. Hence, a gene can be listed in multiple rows if it has multiple unique linkage patterns.
 Unlinked unique Only the unique genes are listed along with how many links they have. There is no indication of what genes are linked (but this is faster then the above).
ShowGroupShow the group (size-group number).
report cluster options
In the two images below, the Unlinked unique is shown on the left and the Linked rows is shown on the right. Each cluster will have all species since the Cluster gene query included them all (i.e Inc(Hsa,Mus,Pan)), but not all genes will be linked; the Linked rows shows the genes that are linked.

Unlinked unique: Cluster #1 (7-1) shows that there is only one Hsa gene, which has 4 links; likewise for the one Mus gene; however, there are 5 Pan genes. If there are only two species shown, then this is an easier way to view the results.

Linked rows: Every gene listed in a row is linked. The first row of 7-1 is the only one that is linked across all species. There are 229 fully linked rows in the 177 clusters; each one has the number of fully linked rows in parenthesis next the cluster label. This is the most informative when >2 species are included.

report cluster

4.6 Search...

Go to top
The Search... produces a popup as shown on the right.

The columns in the popup will vary depending on the columns displayed in the table. This will only show the columns that are searchable. Besides the top 4 shown, the species Gene# and all of the Gene Annotation columns are searchable.

Search results in the table positioning to the row with the first occurrence of the search string, and the row will be selected.

Last#This searches on the last number of a "." or "-" delineated string, i.e. Block, Collinear, Group and Gene#.
For a Gene# with a character suffix (e.g. 1.1.a for chr01, gene# 1.a), enter the 1.a.
ExactThis searches for the exact string.
SubstringThis searches for a substring, which is most relevant for the Gene Annotation of 'desc' or 'product'.
Query search

5. 3-chromosomes

Go to top
Report is only way to view more than two species in a query row (described immediately above).
→As shown here, it is possible to bring up the 2D display with 3 chromosomes from the query table (released v5.6.2).

Select two rows with the following requirements: (1) Both genes must exist in each row. (2) There must be a shared gene.
Select 2 rows

The two chromosomes with the unique genes may be from the same species or different species.
Select 2 rows

Select View 2D with either the Region and Collinear options. The image on the right used the Collinear options.

 

2D Highlight Conserved Genes shows how to highlight shared or unique genes in the 2D display of 3 chromosomes.

report A.thal ref

> Results

Go to top
query results tab

All query results are listed under the > Results tab on the left. Clicking this tab shows the table of queries illustrated above. Query results can be displayed by clicking a result on the left panel, or double-clicking it in the list of results table shown above.

The only way to remove query results from the left tab is by selecting them in this table followed by Remove Selected, all remove all with Remove All.

Go to top
Email Comments To: symap@agcol.arizona.edu