Hunspell介绍及试用
1、简介
Hunspell是一个为拥有多态和复杂组合词的语言所设计的拼写检查器,原本为匈牙利语设计。
Hunspell是一个自由软件,在GPL、LGPL和MPL三许可证下发行。
Hunspell对主要平台和编程语言都有接口和封装。Hunspell基于MySpell,并且与MySpell词典后端兼容。MySpell使用单字节字符编码,而Hunspell则可以使用Unicode UTF-8编码的词典。
2、以下应用程序使用Hunspell作为拼写检查器:
Mac OS X10.6 以及之后版本
Eclipse,使用Hunspell4Eclipse
Google Chrome,Google开发的一个网页浏览器
Evernote,笔记软件
LibreOffice和OpenOffice.org,开源办公组件
Mozilla Firefox和Thunderbird以及SeaMonkey
Opera,一个跨平台的网页浏览器
Scribus,桌面出版应用
Vim,一个文本编辑器
WPS Office,国产办公组件
3、使用docker镜像测试Hunspell的功能:
3.1查看可用字典
[root@host---- hunspell]# docker run --rm tmaier/hunspell -D
SEARCH PATH:
.::/usr/share/hunspell:/usr/share/myspell:/usr/share/myspell/dicts:/Library/Spelling:/root/.openoffice.org//user/wordbook:/root/.openoffice.org2/user/wordbook:/root/.openoffice.org2./user/w/lib/openoffice.org/basis3./share/dict/ooo:/opt/openoffice.org2./share/dict/ooo:/usr/lib/openoffice.org2./share/dict/ooo:/opt/openoffice.org2./share/dict/ooo:/usr/lib/openoffice.org2./shhare/dict/ooo:/opt/openoffice.org2./share/dict/ooo:/usr/lib/openoffice.org2./share/dict/ooo:/opt/openoffice.org2./share/dict/ooo:/usr/lib/openoffice.org2./share/dict/ooo
AVAILABLE DICTIONARIES (path is not mandatory for -d option):
/usr/share/hunspell/en_CA
/usr/share/hunspell/de_DE_comb
/usr/share/hunspell/en_ZA
/usr/share/hunspell/en_US
/usr/share/hunspell/en_GB
/usr/share/hunspell/en_AU
/usr/share/hunspell/de_CH
/usr/share/hunspell/de_DE_neu
/usr/share/hunspell/en_NZ
/usr/share/hunspell/de_AT
/usr/share/hunspell/default
LOADED DICTIONARY:
/usr/share/hunspell/default.aff
/usr/share/hunspell/default.dic
Hunspell 1.6.
3.2查看帮助信息
[root@host---- hunspell]# docker run --rm -v $(pwd):/workdir tmaier/hunspell -u3 -i utf- -d de_DE_neu,en_US,de_CH -p words -h
Usage: hunspell [OPTION]... [FILE]...
Check spelling of each FILE. Without FILE, check standard input. - check only first field in lines (delimiter = tabulator)
-a Ispell's pipe interface
--check-url check URLs, e-mail addresses and directory paths
--check-apostrophe check Unicode typographic apostrophe
-d d[,d2,...] use d (d2 etc.) dictionaries
-D show available dictionaries
-G print only correct words or lines
-h, --help display this help and exit
-H HTML input file format
-i enc input encoding
-l print misspelled words(只打印错误的单词)
-L print lines with misspelled words(打印错误单词所在行)
-m analyze the words of the input text
-n nroff/troff input file format
-O OpenDocument (ODF or Flat ODF) input file format
-p dict set dict custom dictionary
-r warn of the potential mistakes (rare words)
-P password set password for encrypted dictionaries
-s stem the words of the input text
-S suffix words of the input text
-t TeX/LaTeX input file format
-v, --version print version number
-vv print Ispell compatible version number
-w print misspelled words (= lines) from one word/line input.
-X XML input file format Example: hunspell -d en_US file.txt # interactive spelling
hunspell -i utf- file.txt # check UTF- encoded file
hunspell -l *.odt # print misspelled words of ODF files # Quick fix of ODF documents by personal dictionary creation # Make a reduced list from misspelled and unknown words: hunspell -l *.odt | sort | uniq >words # Delete misspelled words of the file by a text editor.
# Use this personal dictionary to fix the deleted words: hunspell -p words *.odt Bug reports: http://hunspell.github.io/
3.3检查某个文档的拼写(显示错误词所在行数及建议更改)原文:test1.TXT(链接:https://pan.baidu.com/s/17JRmtnebLblVsMG05CIm-w 密码:l3q9)
[root@host---- hunspell]# docker run --rm -v $(pwd):/workdir tmaier/hunspell -u3 -i utf- -d de_DE_neu,en_US,de_CH -p words test1.TXT
test1.TXT:: Locate: rans | Try: rand
test1.TXT:: Locate: wew | Try: woo
test1.TXT:: Locate: Sevenn | Try: Severn
test1.TXT:: Locate: cannt | Try: canny
test1.TXT:: Locate: Hmm | Try: Mm
test1.TXT:: Locate: Lele | Try: Lee
test1.TXT:: Locate: Lele | Try: Lee
test1.TXT:: Locate: Lele | Try: Lee
test1.TXT:: Locate: Lele | Try: Lee
test1.TXT:: Locate: Hmm | Try: Mm
test1.TXT:: Locate: Hmm | Try: Mm
test1.TXT:: Locate: ve | Try: be
test1.TXT:: Locate: ve | Try: be
test1.TXT:: Locate: ve | Try: be
test1.TXT:: Locate: Hmm | Try: Mm
test1.TXT:: Locate: ve | Try: be
test1.TXT:: Locate: hasn | Try: has
test1.TXT:: Locate: isn | Try: sin
test1.TXT:: Locate: ve | Try: be
test1.TXT:: Locate: ve | Try: be
test1.TXT:: Locate: Hmm | Try: Mm
test1.TXT:: Locate: Hmm | Try: Mm
test1.TXT:: Locate: wasn | Try: wans
test1.TXT:: Locate: isn | Try: sin
test1.TXT:: Locate: isn | Try: sin
test1.TXT:: Locate: vomeronasal | Try: astronomer
test1.TXT:: Locate: didn | Try: did
test1.TXT:: Locate: ve | Try: be
test1.TXT:: Locate: weren | Try: were
test1.TXT:: Locate: wasn | Try: wans
test1.TXT:: Locate: wouldn | Try: would
test1.TXT:: Locate: weren | Try: were
test1.TXT:: Locate: ve | Try: be
test1.TXT:: Locate: ve | Try: be
test1.TXT:: Locate: cefepime | Try: timepiece
test1.TXT:: Locate: amikacin | Try: Kamikaze
test1.TXT:: Locate: Mmm | Try: Mm
test1.TXT:: Locate: kuai | Try: Kauai
test1.TXT:: Locate: ve | Try: be
test1.TXT:: Locate: isn | Try: sin
test1.TXT:: Locate: ve | Try: be
test1.TXT:: Locate: aren | Try: earn
test1.TXT:: Locate: shouldn | Try: should
test1.TXT:: Locate: whould | Try: would
test1.TXT:: Locate: Hmm | Try: Mm
test1.TXT:: Locate: Hmm | Try: Mm
test1.TXT:: Locate: Hmm | Try: Mm
test1.TXT:: Locate: ve | Try: be
test1.TXT:: Locate: Hmm | Try: Mm
test1.TXT:: Locate: Hmm | Try: Mm
test1.TXT:: Locate: Uhh | Try: Shh
test1.TXT:: Locate: Chh | Try: Ch
test1.TXT:: Locate: Hmm | Try: Mm
test1.TXT:: Locate: isn | Try: sin
test1.TXT:: Locate: ve | Try: be
test1.TXT:: Locate: exfoliator | Try: defoliator
test1.TXT:: Locate: didn | Try: did
test1.TXT:: Locate: didn | Try: did
test1.TXT:: Locate: Hmm | Try: Mm
test1.TXT:: Locate: ve | Try: be
Hunspell介绍及试用的更多相关文章
- fasttext介绍和试用
http://fasttext.apachecn.org/cn/docs/v0.1.0/support.html fasttext介绍网站 https://github.com/facebookres ...
- firefox插件-HackBar介绍与试用
This toolbar will help you in testing sql injections, XSS holes and site security. It is NOT a tool ...
- Unity4.6新UI系统初探(uGUI)
一.引言 Unity终于在即将到来的4.6版本内集成了所见即所得的UI解决方案(视频).事实上从近几个版本开始,Unity就在为这套系统做技术扩展,以保证最终能实现较理想的UI系统.本文试图通过初步的 ...
- 打造自己博客(wordpress)的wap手机版本
这儿介绍我试用的两款插件:wordpress-mobile-edition和wp-t-wap.1.先说一下两者的区别. wordpress-mobile-edition插件使用后,可以用手机直接访问你 ...
- React中使用CSSTransitionGroup插件实现轮播图
动画效果,是一个页面上必不可少的功能,学习一个新的东西,当然就要学习,如何用新的东西,用它的方法去实现以前的东西啦.今天呢,我就在这里介绍一个试用react-addons-css-transition ...
- 『编程题全队』"Gugua"事务管理系统项目宣传文案
一.项目简介 1.项目简介 Gugua是为了解决有事务管理需要的人群的痛苦, 他们需要 一个便利和高效的个人和团体事务管理平台,但是现有的方案并没有很好地解决这些需求,我们有独特的办法是提供跨平台的软 ...
- javascript中 try catch用法
javascript中 try catch用法 投稿:hebedich 字体:[增加 减小] 类型:转载 时间:2015-08-16我要评论 JS try catch语句一般在什么情况下使用?是必须使 ...
- 活动助手Beta用户试用报告
用户试用报告 1.面向参与者用户 1.1 日常参加各类学习(水综测)活动中,有没有遇到以下问题: (1) 信息来源混乱,不知道靠不靠谱 (2) 每次报名都要重新填写自己的学号手机号,有时候填错了就没综 ...
- 达梦7的试用 与SQLSERVER的简单技术对比
达梦7的试用 与SQLSERVER的简单技术对比 达梦数据库公司推出了他们的数据库服务管理平台,可以在该平台使用达梦数据库而无须安装达梦7数据库 地址:http://online.dameng.com ...
随机推荐
- (66)Wangdao.com第十一天_JavaScript 数组Array
数组 Array 本质上,数组属于一种特殊的对象.typeof 运算符会返回数组的类型是 object 数组的特殊性体现在,它的键名是按次序排列的一组整数(0,1,2...) // Object.ke ...
- elasticsearch之分词插件使用
elasticsearch对英文会拆成单个单词,对中文会拆分成单个字.下面来看看是不是这样. 首先测试一下英文: GET /blog/_analyze { "text": &quo ...
- 2019.4.10 初识puppeteer
注:原地址:https://www.cnblogs.com/paris-test/p/9705075.html 一.Puppeteer 介绍 Puppeteer 翻译是操纵木偶的人,利用这个工具,我们 ...
- pta第一次总结
1).实验代码 include<stdio.h> int main() { int n,i,j,a1,a2,b1,b2,sum,x,k; char c; while(~scanf(&quo ...
- Linux下面使用命令如何运行.sh文件的两种解决办法
Linux下面用命令如何运行.sh文件的方法,有两种方法: 一.直接./加上文件名.sh,如运行hello.sh为./hello.sh[hello.sh必须有x权限] 二.直接sh 加上文件名.sh, ...
- 文本不能被选中的css
-moz-user-select: none; -webkit-user-select: none; -ms-user-select: none; user-select: none;
- [dev] Go的协程切换问题
子标题:runtime.Gosched() 是干嘛用的? 1. go程序都有一个环境变量,做线程数设置 GOMAXPROCS 2. 当协程数小于等于线程数的时候,程序行为上与多线程没有区别. 3. 当 ...
- NABCD---生活日历
本软件的主体为日历.如今人们的生活趋于现代化,大家主要通过手机来查看日期,但我们每一天都有想要做或未完成的事情,也存在对自己有特殊意义的日子.每天人们都会有或大或小的支出与收入,想要系统的记录这些支出 ...
- Vue学习记录第二天
又来做笔记啦,今天又自暴自弃了,还好及时清醒过来了,什么时候努力都不晚,主要是要一直坚持下去,只要坚持就一定会有收获,所有成功得人背后都是付出了巨大得努力的,没有人平白无故的成功.看似光鲜亮丽的背后, ...
- JVM入门到放弃之基本概念
1. 基本概念 jvm 是可运行Java代码的假想计算机,包括一套字节码指令集.一组寄存器.一个栈.一个垃圾回收堆和一个存储方法域. jvm 是运行在操作系统之上的,屏蔽了与具体操作系统平台相关的信息 ...