DAVID Documentation: Functional Annotation Clustering

Functional Annotation Clustering groups similar annotations together using a novel fuzzy clustering algorithm. The algorithm is based on the hypothesis that similar annotations should have similar gene members. The Functional Annotation Clustering integrates the same techniques of Kappa statistics to measure the degree of the common genes between two annotations.

Functional Annotation Clustering Interface 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22

  1. Gene List
    Current gene list selected for view and background species being analyzed.
  2. Help and Switch to Classic Version
    Link to this help page and provides access to the legacy Gene List Report interface.
  3. Show/Hide Options
    Click this button to expand or hide additional filtering options.
  4. Classification Stringency
    Click to access a drop-down menu of stringency levels, then click desired level to select. A high-level single control to establish a set of detailed parameters involved in functional classification algorithms. In general, the higher stringency setting generates less functional groups with more tightly associated genes in each group, so that more genes will be unclustered.
  5. EASE Threshold
    Minimum P-value of results to be displayed. Click to either type a numberical value or use the up/down arrows to increment by 0.01.
  6. Update Table
    After making adjustments to filtering options, click this button to update the table using the selected parameters.
  7. Similarity Term Overlap
    Click to access a drop-down menu of overlap values. The minimum number of annotation terms overlapped between two genes in order to be qualified for kappa calculation. This parameter is to maintain necessary statistical power to make the kappa value more meaningful. The higher the value, the more meaningful the result is.
  8. Similarity Threshold
    Click to access a drop-down menu of theshrold values. The minimum kappa value to be considered significant. A higher setting will lead to more genes going unclustered, which leads to a higher quality functional classification result with fewer groups and fewer gene members. Kappa value of 0.3 starts giving meaningful biology based on our genome-wide distribution study. Anything below 0.3 has a good chance to be noise.
  9. Initial Group Membership
    Click to access a drop-down menu of membership values. The minimum gene number in a seeding group, which affects the minimum size of each functional group in the final cluster. In general, the lower value attempts to include more genes in functional groups, and may generate a lot of small size groups.
  10. Final Group Membership
    Click to access a drop-down menu of membership values. The minimum gene number in one final group after a 'cleanup' procedure. In general, the lower value attempts to include more genes in functional groups and may generate a lot of small size groups. It cofunctions with previous parameters to control the minimum size of functional groups. If you are interested in functional groups containing only 2 or 3 genes, you need to set it to a very low value. Otherwise, the small group will not be displayed and the genes will go unclustered.
  11. Multiple Linkage Threshold
    Click to access a drop-down menu of threshold values. This parameter controls how seeding groups merge with each other, i.e. two groups sharing the same gene members over the percentage will become one group. A higher percentage, in general, gives sharper separation (i.e. it generates more final functional groups with more tightly associated genes in each group). In addition, changing the parameter does not cause additional genes to go unclustered.
  12. The search box filters all table results in real time.
  13. Create sublist, Export options & Column Visibility
    Create a sublist by clicking on checkboxes to the left of desired results, then clicking the Create sublist button, where you will be prompted to name the new sublist. Sublists will then appear in the list manager as a new gene list.
    Results can be exported using Copy, Excel, CSV, or PDF. Exports reflect the current filtered and sorted view.
    Additional columns can be added to the current view using the Column Visibility drop-down button. Click on available items to add the column to the table.
  14. Click on any table column header to sort the table by that column in ascending or descending order.
  15. Cluster
    A group of terms having similar biological meaning due to sharing similar gene members.
  16. Enrichment Score
    The overall enrichment score for the group based on the EASE scores of each term member. Higher value = more enriched.
  17. Gene Report
    Click on the red G to be directed to the Gene Report page for that genes in this cluster.
  18. Cluster view
    Click this icon to be directed to a 2D cluster view of the genes in this cluster.
  19. Annotations
    Annotations available for each item. If available, hyperlinks lead users to original resources for further detials.
  20. Related Terms
    Click RT to be directed to a Functionally Related Terms search result for each term. For more information, see the Related Terms documentation page.
  21. Gene Count and Percentage
    Count and percentage of genes involved in each term.
  22. P-value & Benjamini
    EASE Score where smaller means more enriched. Benjamini in DAVID requests adjusted p-values by using the linear step-up method of Benjamini and Hochberg