What symbols should be used for representing nucleotides in a sequence listing?

The MPEP 2423.01 specifies the symbols to be used for representing nucleotides in a sequence listing:

“The bases in nucleotide sequences must be represented using the one-letter code for nucleotide sequence characters. Only lower case letters in conformity with the list given in WIPO Standard ST.25 (1998), Appendix 2, Table 1, may be used.”

The approved one-letter codes for nucleotides are:

  • a – for adenine
  • c – for cytosine
  • g – for guanine
  • t – for thymine in DNA or uracil in RNA

It’s important to note that only lowercase letters should be used, as specified by the WIPO Standard ST.25.

To learn more:

Topics: MPEP 2400 - Biotechnology, MPEP 2423.01 - Format And Symbols To Be Used In A "Sequence Listing", Patent Law, Patent Procedure
Tags: Nucleotides, One-Letter Code, sequence listing, Wipo Standard St.25