Skip to content

chuuhtetnaing/myanmar-nrc-format-dataset

Repository files navigation

Myanmar NRC Format Data

This repository contains cleaned and standardized data for Myanmar National Registration Card (NRC) format references.

Dataset Files

  • nrc_dataset.csv / nrc_dataset.json: Cleaned and simplified dataset containing only the standardized fields (original fields were removed):

    • state_code: Numeric code representing the state/region (1-14)
    • township_code_en: Township code in English (e.g., "AHGAYA")
    • township_code_mm: Township code in Myanmar (e.g., "အဂယ")
    • township_en: Township name in English
    • township_mm: Township name in Myanmar
    • state_en: State/region name in English
    • state_mm: State/region name in Myanmar
  • nrc.json: Optimized dataset specifically prepared for direct use in web applications or mobile apps with minimal post-processing required. This file uses the same cleaned data structure but is formatted for easy integration into frontend applications.

  • full_nrc_dataset.csv / full_nrc_dataset.json: Extended dataset containing both original fields from the source data and the cleaned fields, providing a comprehensive reference and allowing comparison between original and cleaned data.

Data Sources and Cleaning Process

  • Original data sourced from KBZ MPU (Myanmar Payment Union) website
  • Data cleaning and verification performed using:
    • Wikipedia references for Myanmar townships
    • Manual verification for township names and codes
    • Correction of spelling inconsistencies and errors
    • Standardization of township and state names

Usage

This dataset can be used for:

  • NRC format validation
  • Township and state code reference
  • Building applications that require Myanmar NRC data

License

This dataset is free to use for any purpose - personal, commercial, or educational.

Contributions

Feel free to create a pull request if you find any errors or have improvements to suggest for the dataset. Contributions to enhance the accuracy and completeness of the data are welcome.