what are stop words

一、总结

一句话总结:就是在seo的关键词中不要有stop words,不然的话搜索引擎会直接忽略

stop words  most common  words language

In computingstop words are words which are filtered out before or after processing of natural language data (text).[1] Though "stop words" usually refers to the most common words in a language, there is no single universal list of stop words used by all natural language processing tools, and indeed not all tools even use such a list. Some tools specifically avoid removing these stop words to support phrase search.

Any group of words can be chosen as the stop words for a given purpose. For some search engines, these are some of the most common, short function words, such as theisatwhich, and on. In this case, stop words can cause problems when searching for phrases that include them, particularly in names such as "The Who", "The The", or "Take That". Other search engines remove some of the most common words—including lexical words, such as "want"—from a query in order to improve performance.[2]

Hans Peter Luhn, one of the pioneers in information retrieval, is credited with coining the phrase and using the concept.[3] The phrase "stop word", which is not in Luhn's 1959 presentation, and the associated terms "stop list" and "stoplist" appear in the literature shortly afterwards.[4]

A predecessor concept was used in creating some concordances. For example, the first Hebrew concordance, Me’ir nativ, contained a one-page list of unindexed words, with nonsubstantive prepositions and conjunctions which are similar to modern stop words.[5]

In SEO terminology, stop words are the most common words that most search engines avoid, saving space and time in processing large data during crawling or indexing. This helps search engines to save space in their databases.

1、Stop words list?

What exactly are stop words? According to Wikipedia, stop words are the most common words in a language. Since there is no single universal list of stop words available, we’ve created our own list. Learn more about stop words here.

The following list contains most of the stop words used by Yoast SEO and Yoast SEO Premium in English. The full list can be found here.

  • a
  • about
  • above
  • after
  • again
  • against
  • all
  • am
  • an
  • and
  • any
  • are
  • as
  • at
  • be
  • because
  • been
  • before
  • being
  • below
  • between
  • both
  • but
  • by
  • could
  • did
  • do
  • does
  • doing
  • down
  • during
  • each
  • few
  • for
  • from
  • further
  • had
  • has
  • have
  • having
  • he
  • he’d
  • he’ll
  • he’s
  • her
  • here
  • here’s
  • hers
  • herself
  • him
  • himself
  • his
  • how
  • how’s
  • I
  • I’d
  • I’ll
  • I’m
  • I’ve
  • if
  • in
  • into
  • is
  • it
  • it’s
  • its
  • itself
  • let’s
  • me
  • more
  • most
  • my
  • myself
  • nor
  • of
  • on
  • once
  • only
  • or
  • other
  • ought
  • our
  • ours
  • ourselves
  • out
  • over
  • own
  • same
  • she
  • she’d
  • she’ll
  • she’s
  • should
  • so
  • some
  • such
  • than
  • that
  • that’s
  • the
  • their
  • theirs
  • them
  • themselves
  • then
  • there
  • there’s
  • these
  • they
  • they’d
  • they’ll
  • they’re
  • they’ve
  • this
  • those
  • through
  • to
  • too
  • under
  • until
  • up
  • very
  • was
  • we
  • we’d
  • we’ll
  • we’re
  • we’ve
  • were
  • what
  • what’s
  • when
  • when’s
  • where
  • where’s
  • which
  • while
  • who
  • who’s
  • whom
  • why
  • why’s
  • with
  • would
  • you
  • you’d
  • you’ll
  • you’re
  • you’ve
  • your
  • yours
  • yourself
  • yourselves

二、List of stop words

 

随机推荐

  1. 选择排序法、冒泡排序法、插入排序法、系统提供的底层sort方法排序之毫秒级比较

    我的代码: package PlaneGame;/** * 选择排序法.冒泡排序法.插入排序法.系统提供的底层sort方法排序之毫秒级比较 * @author Administrator */impo ...

  2. python之路-day1-if...else...流程判断

    判断输入的用户名:#Author:zww _username = "zww" _password = " username = input("username: ...

  3. 20155201 网络攻防技术 实验五 MSF基础应用

    20155201 网络攻防技术 实验五 MSF基础应用 一.实践内容 一个主动攻击实践,如ms08_067 一个针对浏览器的攻击,如ms11_050 一个针对客户端的攻击,如Adobe 成功应用任何一 ...

  4. Adobe阅读器渗透攻击

    Adobe阅读器渗透攻击 实验前准备 1.两台虚拟机,其中一台为kali,一台为windows xp sp3(老师给的xp虚拟机winxpAttaker,密码:mima1234). 2.设置虚拟机网络 ...

  5. fread和fseek的用法

    原味:http://baike.baidu.com/view/656696.htm    http://baike.baidu.com/view/656689.htm fread 功 能: 从一个流中 ...

  6. Codeforces 827E Rusty String - 快速傅里叶变换 - 暴力

    Grigory loves strings. Recently he found a metal strip on a loft. The strip had length n and consist ...

  7. EGIT

    https://jingyan.baidu.com/article/64d05a0262f013de55f73bcc.html

  8. 配置vim

    VundleVuldle是一个全自动的插件管理器,让我们通过维护插件列表的方式管理插件.它为安装.更新.删除插件提供了方便的命令.在安装Git的情况下(本文不赘述Git的安装),输入命令: git c ...

  9. (zhuan) Speech and Natural Language Processing

    Speech and Natural Language Processing obtain from this link: https://github.com/edobashira/speech-l ...

  10. (转载)C# GDI+ 画简单的图形:直线、矩形、扇形等

    GDI+是一种绘图装置接口, 当拖动窗体是,窗体发生移动,window默认为从窗体移动到另一个地方,先发生擦除后再重新画一个窗体: 而我们自己动手画的图(如下面的线),不会重新画:在属性中,Paint ...