How should sequences with defined regions separated by undisclosed gaps be represented in a Sequence Listing XML?
The MPEP 2412.05(e) provides specific guidance on how to represent sequences with defined regions separated by undisclosed gaps in a Sequence Listing XML:
“A nucleotide and/or amino acid sequence that contains regions of specifically defined residues separated by one or more gaps of an unknown or undisclosed number of residues must be listed in the ‘Sequence Listing XML’ in the manner described in paragraph 37 of WIPO Standard ST.26.”
The MPEP further clarifies that such sequences must not be represented as a single sequence. Instead, each region of specifically defined residues must be included as a separate sequence with its own sequence identifier. This approach ensures clarity and accuracy in the representation of sequences with unknown gaps.
To learn more: