首页| JavaScript| HTML/CSS| Matlab| PHP| Python| Java| C/C++/VC++| C#| ASP| 其他|
购买积分 购买会员 激活码充值

您现在的位置是:虫虫源码 > Python > Python使用N-gram语言探测器

Python使用N-gram语言探测器

  • 资源大小:126.28 kB
  • 上传时间:2021-06-29
  • 下载次数:0次
  • 浏览次数:1次
  • 资源积分:1积分
  • 标      签: python 语言 NGram 使用 探测器

资 源 简 介

Language detector - as the name suggests is a program that is capable of detecting the language for any given description. The system will have a specific pattern for each language, which it uses to identify the language of the given description based on the closest matching pattern. In data analysis operations, we may need to restrict to a limited set of languages getting into the system - where the Language detectors comes in handy. The existing language detector available for python is "oice.langdet" - it lacks several features that a STANDARD language detector is expected to have. Few of the features are, (i) Ability to detect multiple languages (currently only 3 languages supported) (ii) It does a "Bi-gram" analysis on the input data. Which can lead to wrong predictions in some cases? (Lesser accuracy) (iii) It is available only for "python" / usable only by python-programs. Shouldn"t it be usable

文 件 列 表

Langdet
san
ngram_client.py
English
Malay
Spanish
ngram_client.py~

相 关 资 源

您 可 能 感 兴 趣 的

同 类 别 推 荐

VIP VIP