Skip to content

About XML filter #10

@ShaneTian

Description

@ShaneTian

In the StarCoder, the XML filter removes some files that contain <?xml version= within the first 100 characters.
image

However, this step is not included in the language-specific filters of the the-stack-v2.
image

So, why remove this filter?

Metadata

Metadata

Assignees

No one assigned

    Labels

    No labels
    No labels

    Type

    No type
    No fields configured for issues without a type.

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions