How should coding sequences (CDS) be represented in a Sequence Listing XML?

Coding sequences (CDS) in a Sequence Listing XML should be represented using the “CDS” feature key. According to MPEP 2413.01(g):

“The ‘CDS’ feature key may be used to identify coding sequences, i.e., sequences of nucleotides which correspond to the sequence of amino acids in a protein and the stop codon. The location of the ‘CDS’ feature in the mandatory element INSDFeature_location must include the stop codon.”

Additionally:

  • The “transl_table” and “translation” qualifiers may be used with the “CDS” feature key.
  • The “transl_except” qualifier must be used to identify codons that encode pyrrolysine or selenocysteine.
  • The amino acid sequence encoded by the CDS must be included in the sequence listing with its own sequence identifier.

Proper representation of CDS is crucial for accurate interpretation of the genetic information in patent applications.

To learn more:

Topics: MPEP 2400 - Biotechnology, MPEP 2413.01(G) - The "Sequence Listing Xml" Must Contain A Sequence Data Part, Patent Law, Patent Procedure
Tags: Cds, Coding Sequences, sequence listing xml, wipo standard st.26