8000 GitHub - fengyangsun/find-Pinyin-names: Find Pinyin combinations (e.g. Chinese names, or any other Pinyins of Chinese characters) from a group of strings.
[go: up one dir, main page]
More Web Proxy on the site http://driver.im/
Skip to content

Find Pinyin combinations (e.g. Chinese names, or any other Pinyins of Chinese characters) from a group of strings.

License

Notifications You must be signed in to change notification settings

fengyangsun/find-Pinyin-names

< 10000 script crossorigin="anonymous" defer="defer" type="application/javascript" src="https://github.githubassets.com/assets/ui_packages_code-view-shared_hooks_use-canonical-object_ts-ui_packages_code-view-shared_hooks-6664ce-382869c58728.js">

Repository files navigation

find-Pinyin-names

I think it's easy to read.

Simply call the function 'find_pinyin_names()' with input a list of names, and it will return all the Pinyin style names and non-Pinyin style names. You can change it to only returning Pinyin names, or only indices of the Pinyin names.

For the names that are separated with ",", " ", "-", it works well to identify them. You may also add more regular expressions to make it compatible with more name cases.

The 'pinyin.txt' is a very important database file to be used for matching patterns.

In future, I may try to also include Wade-Giles romanization checking if I got time, so to also recognize e.g. Taiwanese.

Example Output

['a', 'ai', 'an', 'ang', 'ao', 'ba', 'bai', 'ban', 'bang', 'bao', 'bei', 'ben', 'beng', 'bi', 'bian', 'biao', 'bie', 'bin', 'bing', 'bo', 'bu', 'ca', 'cai', 'can', 'cang', 'cao', 'ce', 'cen', 'ceng', 'cha', 'chai', 'chan', 'chang', 'chao', 'che', 'chen', 'cheng', 'chi', 'chong', 'chou', 'chu', 'chua', 'chuai', 'chuan', 'chuang', 'chui', 'chun', 'chuo', 'ci', 'cong', 'cou', 'cu', 'cuan', 'cui', 'cun', 'cuo', 'da', 'dai', 'dan', 'dang', 'dao', 'de', 'dei', 'den', 'deng', 'di', 'dian', 'diang', 'diao', 'die', 'ding', 'diu', 'dong', 'dou', 'du', 'duan', 'dui', 'dun', 'duo', 'e', 'ê', 'ei', 'en', 'eng', 'er', 'fa', 'fan', 'fang', 'fei', 'fen', 'feng', 'fo', 'fou', 'fu', 'ga', 'gai', 'gan', 'gang', 'gao', 'ge', 'gei', 'gen', 'geng', 'gong', 'gou', 'gu', 'gua', 'guai', 'guan', 'guang', 'gui', 'gun', 'guo', 'ha', 'hai', 'han', 'hang', 'hao', 'he', 'hei', 'hen', 'heng', 'hong', 'hou', 'hu', 'hua', 'huai', 'huan', 'huang', 'hui', 'hun', 'huo', 'ji', 'jia', 'jian', 'jiang', 'jiao', 'jie', 'jin', 'jing', 'jiong', 'jiu', 'ju', 'juan', 'jue', 'jun', 'ka', 'kai', 'kan', 'kang', 'kao', 'ke', 'kei', 'ken', 'keng', 'kong', 'kou', 'ku', 'kua', 'kuai', 'kuan', 'kuang', 'kui', 'kun', 'kuo', 'la', 'lai', 'lan', 'lang', 'lao', 'le', 'lei', 'leng', 'li', 'lia', 'lian', 'liang', 'liao', 'lie', 'lin', 'ling', 'liu', 'lo', 'long', 'lou', 'lu', 'lv', 'lyu', 'luan', 'lve', 'lyue', 'lun', 'luo', 'ma', 'mai', 'man', 'mang', 'mao', 'me', 'mei', 'men', 'meng', 'mi', 'mian', 'miao', 'mie', 'min', 'ming', 'miu', 'mo', 'mou', 'mu', 'na', 'nai', 'nan', 'nang', 'nao', 'ne', 'nei', 'nen', 'neng', 'ni', 'nia', 'nian', 'niang', 'niao', 'nie', 'nin', 'ning', 'niu', 'nong', 'nou', 'nu', 'nv', 'nyu', 'nuan', 'nve', 'nyue', 'nun', 'nuo', 'o', 'ou', 'pa', 'pai', 'pan', 'pang', 'pao', 'pei', 'pen', 'peng', 'pi', 'pian', 'piao', 'pie', 'pin', 'ping', 'po', 'pou', 'pu', 'qi', 'qia', 'qian', 'qiang', 'qiao', 'qie', 'qin', 'qing', 'qiong', 'qiu', 'qu', 'quan', 'que', 'qun', 'ran', 'rang', 'rao', 're', 'ren', 'reng', 'ri', 'rong', 'rou', 'ru', 'ruan', 'rui', 'run', 'ruo', 'sa', 'sai', 'san', 'sang', 'sao', 'se', 'sei', 'sen', 'seng', 'sha', 'shai', 'shan', 'shang', 'shao', 'she', 'shei', 'shen', 'sheng', 'shi', 'shong', 'shou', 'shu', 'shua', 'shuai', 'shuan', 'shuang', 'shui', 'shun', 'shuo', 'si', 'song', 'sou', 'su', 'suan', 'sui', 'sun', 'suo', 'ta', 'tai', 'tan', 'tang', 'tao', 'te', 'tei', 'teng', 'ti', 'tian', 'tiao', 'tie', 'ting', 'tong', 'tou', 'tu', 'tuan', 'tui', 'tun', 'tuo', 'wa', 'wai', 'wan', 'wang', 'wei', 'wen', 'weng', 'wo', 'wu', 'xi', 'xia', 'xian', 'xiang', 'xiao', 'xie', 'xin', 'xing', 'xiong', 'xiu', 'xu', 'xuan', 'xue', 'xun', 'ya', 'yan', 'yang', 'yao', 'ye', 'yi', 'yin', 'ying', 'yong', 'you', 'yu', 'yuan', 'yue', 'yun', 'za', 'zai', 'zan', 'zang', 'zao', 'ze', 'zei', 'zen', 'zeng', 'zha', 'zhai', 'zhan', 'zhang', 'zhao', 'zhe', 'zhei', 'zhen', 'zheng', 'zhi', 'zhong', 'zhou', 'zhu', 'zhua', 'zhuai', 'zhuan', 'zhuang', 'zhui', 'zhun', 'zhuo', 'zi', 'zong', 'zou', 'zu', 'zuan', 'zui', 'zun', 'zuo']

Found PinYin names: ['Hua, Youshan', 'Qiansheng Bao', 'Zhenyin Gao', 'Beiyong Fang', 'Yu-lu Zhuo', 'Songxian Gong', 'Chong Gao', 'Shangjiang Cai', 'Qiyou Fan', 'Qiudong Dou', 'Jiaai Xu', 'Caiwei Lin', 'Chengjing Tang', 'Meiying Lei', 'Meiwei Qu', 'Ye Yu', 'Tuogang Jiao', 'Zhoubei Xu', 'Henghu Yin', 'Shanyang Sun', 'Jingtong Wu', 'Jueying You', 'Haishuang Chu', 'Qiangke Yin', 'Huya Yin', 'Qianxiang Yu', 'Biaodui Yi', 'Liangyi Shi', 'Xinyuan Mo', 'Danyuan Chang', 'Shengxian Jing', 'Xiayu Gao', 'Quandi Ji', 'Kaikai Huo', 'Lanwei Lin', 'Zhujuan Lang', 'Xuyou Sheng', 'Yuanchong Qiang', 'Lingqiu Feng', 'Yunzi Shu', 'Taoxin Mu', 'Jinlu Mou', 'Weiqian Shi', 'Feiyi Hua', 'Yanping Hu', 'Lunshuai Hang', 'Yinbao Deng', 'Hairan Wu', 'Yinjin Chu', 'Qianwei Gao', 'Ruijie Ji', 'Si Ban', 'Jingyuan Jiang', 'Hanyi Pan', 'Xingxiao Lei', 'Qiyun Liu', 'Gaolin Yan', 'Weirong Guan', 'Daowei Yan', 'Zheng Zhuo', 'Junchen Long', 'Juxi Lou', 'Mingjin Zhu', 'Jinting Zong', 'Qingwei Ji', 'Wenpan Wang', 'Hai Cui', 'Liqi Zhan', 'Zemin Lv', 'Kunkun Sheng', 'Yingling Qu', 'Xihe Ping', 'Yiliang Cheng', 'Yueqi Feng', 'Yuanqing Teng', 'Kexin Tang', 'Junyun Cui', 'Hanxuan Jin', 'Geyu Fan', 'Han Yan', 'Qidan Ruan', 'Junyue Qian', 'Yigang Bing', 'Zhanjiang Xu', 'Jing Tang', 'Like Zhan', 'Yujing Zhao', 'Jingfei Shi', 'Rongting Bing', 'Zhenyin Wu', 'Ningfeng Su', 'Chuangliang Ning', 'Canping Tang', 'Yiru Guo', 'Geduo Qiu', 'Daofeng Zou', 'Shuwei Wei', 'Zihong Zhen', 'Lingyi Pei', 'Shangneng Wu']

Found Non-PinYin names: ['Tristan Eliza', 'Supplanter Robin', 'Quintana Eva', 'Powerful Doris', 'Eliza Derek', 'Dion Tatum', 'Donald Spring-like', 'Anastasia Elliott', 'Blessed White', 'Ida Lorraine', 'Hayley Perry', 'Matilda Bertina', 'Beloved Daniel', 'Holly Elijah', 'Erskine Nigel', 'Silvery Priscilla', 'Emeline Rosalind', 'Roger Ferguson', 'Fern Delmar', 'Virtuous Dexterous', 'Joan Counsellor', 'Lorena Travers', 'Shelley Louise', 'Juliet Sloane', 'Princess Ophelia', 'Frederica Wilona', 'Ida Quintana', 'Vaughan Shana', 'Felix Anastasia', 'Maisie Lamont', 'Vaughan Blessed', 'Ada Georgette', 'Eighth Noble', 'Beneficient Kyla', 'Perry Ethen', 'Falcon Priscilla', 'Zebadiah Quenna', 'Felix Dermot', 'Guide Roxanne', 'Essence Hope', 'Samantha Kevin', 'Linette Juliet', 'Nora Dennis', 'Herdsman Owen', 'Rosemary Famous', 'Zebediah Joseph', 'Eunice Fawn', 'Wonderful Humble', 'Annabelle Graceful', 'Lee Dennis', 'Konrad Molly', 'Vanessa Laurel', 'Michelle Farley', 'Morris Gerret', 'Grateful Quenna', 'Simon Molly', 'Vivian Anthea', 'Ruby Articulate', 'Tiffany Elga', 'Gemstone Magdalene', 'Joyce Kilian', 'Gaiety Sherwin', 'Anne Trix', 'Edwin Faithful', 'Female Priscilla', 'Talia Hardy', 'Phyllis Trina', 'Ross Lamont', 'Lee Blanche', 'Gardener Mandy', 'Gaiety Elbert', 'Evan Keaton', 'Travers Enoch', 'Vernon Hugo', 'Opal God-like', 'Francis Laurence', 'Lilah Imagine', 'Davin Juliet', 'Dermot Peggy', 'Powerful Shannon', 'Shelley Sarah', 'Geneva Elijah', 'Lucinda Harvester', 'Kyla Mercy', 'Wilona Beloved', 'Geneva Faith', 'Gardener Elfin', 'Lucinda Dorian', 'Graceful Geneva', 'Anita Bridget', 'Dion Brittany', 'Eaton Hugo', 'Robert Esmeralda', 'Peggy Emrick', 'Sadie Hayley', 'Alison Quintana', 'Idelle Eugenia', 'Joey Todd', 'Edric Female', 'White Maisie']

About

Find Pinyin combinations (e.g. Chinese names, or any other Pinyins of Chinese characters) from a group of strings.

Topics

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages

0