What are the encoding requirements for a Sequence Listing XML file?
The encoding requirements for a Sequence Listing XML file are specified in MPEP 2413.01(a):
- The file must be encoded using Unicode UTF-8
- It must comply with XML 1.0 specifications
- Character usage is restricted based on the element type
Specifically, the MPEP states: “The file must be encoded using Unicode UTF-8, with the following restrictions: (1) the information contained in the elements ApplicantName, InventorName and InventionTitle of the general information part, and the NonEnglishQualifier_value of the sequence data part, may be composed of any valid Unicode characters indicated in the XML 1.0 specification except the Unicode Control code points 0000-001F and 007F-009F.”
To learn more:
Topics:
MPEP 2400 - Biotechnology,
MPEP 2413.01(A) - The "Sequence Listing Xml" Is A Single File Encoded Using Unicode Utf - 8,
Patent Law,
Patent Procedure