What are the encoding requirements for a Sequence Listing XML file?

Source: FAQ (MPEP-Based)BlueIron Update: 2024-09-30

This page is an FAQ based on guidance from the Manual of Patent Examining Procedure. It is provided as guidance, with links to the ground truth sources. This is information only: it is not legal advice.

The encoding requirements for a Sequence Listing XML file are specified in MPEP 2413.01(a):

  • The file must be encoded using Unicode UTF-8
  • It must comply with XML 1.0 specifications
  • Character usage is restricted based on the element type

Specifically, the MPEP states: “The file must be encoded using Unicode UTF-8, with the following restrictions: (1) the information contained in the elements ApplicantName, InventorName and InventionTitle of the general information part, and the NonEnglishQualifier_value of the sequence data part, may be composed of any valid Unicode characters indicated in the XML 1.0 specification except the Unicode Control code points 0000-001F and 007F-009F.”

Topics: MPEP 2400 - Biotechnology MPEP 2413.01(A) - The "Sequence Listing Xml" Is A Single File Encoded Using Unicode Utf - 8 Patent Law Patent Procedure
Tags: sequence listing xml, Unicode Utf-8, Xml Encoding