What are the encoding requirements for a Sequence Listing XML file?

The encoding requirements for a Sequence Listing XML file are specified in MPEP 2413.01(a):

  • The file must be encoded using Unicode UTF-8
  • It must comply with XML 1.0 specifications
  • Character usage is restricted based on the element type

Specifically, the MPEP states: “The file must be encoded using Unicode UTF-8, with the following restrictions: (1) the information contained in the elements ApplicantName, InventorName and InventionTitle of the general information part, and the NonEnglishQualifier_value of the sequence data part, may be composed of any valid Unicode characters indicated in the XML 1.0 specification except the Unicode Control code points 0000-001F and 007F-009F.”

To learn more:

Topics: MPEP 2400 - Biotechnology, MPEP 2413.01(A) - The "Sequence Listing Xml" Is A Single File Encoded Using Unicode Utf - 8, Patent Law, Patent Procedure
Tags: sequence listing xml, Unicode Utf-8, Xml Encoding