phone-numbers / README.md
MrtinoRG's picture
Upload README.md with huggingface_hub
3d3ac32 verified

Phone Numbers Dataset

This dataset contains synthetic phone numbers generated using the Faker library for various locales.

Dataset Description

The dataset includes phone numbers for the following locales:

  • az_AZ
  • cs_CZ
  • de_DE
  • en_AU
  • en_NZ
  • es_CL
  • fa_IR
  • fr_FR
  • hu_HU
  • it_IT
  • lt_LT
  • nl_NL
  • pt_PT
  • sl_SI
  • tl_PH
  • uz_UZ
  • ar_AE
  • bg_BG
  • da_DK
  • de_LI
  • en_CA
  • en_PH
  • es_CO
  • fi_FI
  • he_IL
  • hy_AM
  • ja_JP
  • lv_LV
  • no_NO
  • ro_RO
  • sv_SE
  • tr_TR
  • vi_VN
  • ar_JO
  • bn_BD
  • de_AT
  • de_LU
  • en_GB
  • en_US
  • es_ES
  • fil_PH
  • hi_IN
  • id_ID
  • ka_GE
  • ne_NP
  • pl_PL
  • ru_RU
  • ta_IN
  • tw_GH
  • zh_CN
  • ar_PS
  • bs_BA
  • de_CH
  • el_GR
  • en_IN
  • es_AR
  • es_MX
  • fr_CH
  • hr_HR
  • it_CH
  • ko_KR
  • nl_BE
  • pt_BR
  • sk_SK
  • th_TH
  • uk_UA
  • zh_TW

Dataset Structure

  • phone_number: Synthetically generated phone number
  • country: Country code extracted from the locale

Usage

from datasets import load_dataset

dataset = load_dataset("jablonkagroup/phone-numbers")

Generation Details

  • Maximum iterations per locale: 10,000,000
  • Maximum unique numbers per locale: 1,000,000
  • Total locales: 67

Note

All phone numbers in this dataset are synthetic and generated for research/testing purposes only. They should not be considered real phone numbers.