What character encoding is required for Sequence Listing XML files?

Source: FAQ (MPEP-Based)BlueIron Update: 2024-09-30

This page is an FAQ based on guidance from the Manual of Patent Examining Procedure. It is provided as guidance, with links to the ground truth sources. This is information only: it is not legal advice.

The MPEP 2413.02 specifies a particular character encoding requirement for Sequence Listing XML files:

The character encoding must be UTF-8 compliant.

UTF-8 (Unicode Transformation Format – 8-bit) is a widely used character encoding capable of encoding all possible Unicode code points. This requirement ensures that:

  • The XML file can properly represent a wide range of characters, including those from various languages and scientific symbols.
  • There is consistency in how the USPTO systems process and interpret the submitted files.
  • The content remains readable and searchable across different platforms and systems.

When preparing your Sequence Listing XML file, make sure to save it with UTF-8 encoding to comply with this USPTO requirement and avoid potential issues during the submission process.

Topics: MPEP 2400 - Biotechnology MPEP 2413.02 - Form And Format Of The Xml File Containing The "Sequence Listing Xml" Patent Law Patent Procedure
Tags: Character Encoding, sequence listing xml, Utf-8