What character encoding is required for Sequence Listing XML files?

What character encoding is required for Sequence Listing XML files?

The MPEP 2413.02 specifies a particular character encoding requirement for Sequence Listing XML files:

The character encoding must be UTF-8 compliant.

UTF-8 (Unicode Transformation Format – 8-bit) is a widely used character encoding capable of encoding all possible Unicode code points. This requirement ensures that:

  • The XML file can properly represent a wide range of characters, including those from various languages and scientific symbols.
  • There is consistency in how the USPTO systems process and interpret the submitted files.
  • The content remains readable and searchable across different platforms and systems.

When preparing your Sequence Listing XML file, make sure to save it with UTF-8 encoding to comply with this USPTO requirement and avoid potential issues during the submission process.

To learn more:

Topics: MPEP 2400 - Biotechnology, MPEP 2413.02 - Form And Format Of The Xml File Containing The "Sequence Listing Xml", Patent Law, Patent Procedure
Tags: Character Encoding, sequence listing xml, Utf-8