what are stop words

一、总结

一句话总结:就是在seo的关键词中不要有stop words,不然的话搜索引擎会直接忽略

stop words  most common  words language

In computingstop words are words which are filtered out before or after processing of natural language data (text).[1] Though "stop words" usually refers to the most common words in a language, there is no single universal list of stop words used by all natural language processing tools, and indeed not all tools even use such a list. Some tools specifically avoid removing these stop words to support phrase search.

Any group of words can be chosen as the stop words for a given purpose. For some search engines, these are some of the most common, short function words, such as theisatwhich, and on. In this case, stop words can cause problems when searching for phrases that include them, particularly in names such as "The Who", "The The", or "Take That". Other search engines remove some of the most common words—including lexical words, such as "want"—from a query in order to improve performance.[2]

Hans Peter Luhn, one of the pioneers in information retrieval, is credited with coining the phrase and using the concept.[3] The phrase "stop word", which is not in Luhn's 1959 presentation, and the associated terms "stop list" and "stoplist" appear in the literature shortly afterwards.[4]

A predecessor concept was used in creating some concordances. For example, the first Hebrew concordance, Me’ir nativ, contained a one-page list of unindexed words, with nonsubstantive prepositions and conjunctions which are similar to modern stop words.[5]

In SEO terminology, stop words are the most common words that most search engines avoid, saving space and time in processing large data during crawling or indexing. This helps search engines to save space in their databases.

1、Stop words list?

What exactly are stop words? According to Wikipedia, stop words are the most common words in a language. Since there is no single universal list of stop words available, we’ve created our own list. Learn more about stop words here.

The following list contains most of the stop words used by Yoast SEO and Yoast SEO Premium in English. The full list can be found here.

  • a
  • about
  • above
  • after
  • again
  • against
  • all
  • am
  • an
  • and
  • any
  • are
  • as
  • at
  • be
  • because
  • been
  • before
  • being
  • below
  • between
  • both
  • but
  • by
  • could
  • did
  • do
  • does
  • doing
  • down
  • during
  • each
  • few
  • for
  • from
  • further
  • had
  • has
  • have
  • having
  • he
  • he’d
  • he’ll
  • he’s
  • her
  • here
  • here’s
  • hers
  • herself
  • him
  • himself
  • his
  • how
  • how’s
  • I
  • I’d
  • I’ll
  • I’m
  • I’ve
  • if
  • in
  • into
  • is
  • it
  • it’s
  • its
  • itself
  • let’s
  • me
  • more
  • most
  • my
  • myself
  • nor
  • of
  • on
  • once
  • only
  • or
  • other
  • ought
  • our
  • ours
  • ourselves
  • out
  • over
  • own
  • same
  • she
  • she’d
  • she’ll
  • she’s
  • should
  • so
  • some
  • such
  • than
  • that
  • that’s
  • the
  • their
  • theirs
  • them
  • themselves
  • then
  • there
  • there’s
  • these
  • they
  • they’d
  • they’ll
  • they’re
  • they’ve
  • this
  • those
  • through
  • to
  • too
  • under
  • until
  • up
  • very
  • was
  • we
  • we’d
  • we’ll
  • we’re
  • we’ve
  • were
  • what
  • what’s
  • when
  • when’s
  • where
  • where’s
  • which
  • while
  • who
  • who’s
  • whom
  • why
  • why’s
  • with
  • would
  • you
  • you’d
  • you’ll
  • you’re
  • you’ve
  • your
  • yours
  • yourself
  • yourselves

二、List of stop words

 

随机推荐

  1. Iris Classification on Tensorflow

    Iris Classification on Tensorflow Neural Network formula derivation \[ \begin{align} a & = x \cd ...

  2. Android之RadioButton多行

    RadioGroup设置orientation="vertical"竖向单列显示 RadioGroup设置orientation="horizontal"横向单 ...

  3. 指针delete之后赋值为null

    1.现象 经常看到有些代码在delete之后赋值为null 2.原因 C++标准规定:delete空指针是合法的,没有副作用. 所以我们在Delete指针后赋值为NULL或0是个好习惯.对一个非空指针 ...

  4. topcoder srm 689 div1 -3

    1.给出一个$2*n$的矩阵,只包含小写字母.重新排列各个元素使得任意两个相邻的元素不相同? 思路:按照每种字符的数量降序排序,然后从多到少依次放每一种.放的时候一上一下交错放置. #include ...

  5. topcoder srm 714 div1

    problem1 link 倒着想.每次添加一个右括号再添加一个左括号,直到还原.那么每次的右括号的选择范围为当前左括号后面的右括号减去后面已经使用的右括号. problem2 link 令$h(x) ...

  6. C# asp:FileUpload上传文件使用JS实现预览效果

    js代码: <script type="text/javascript"> //下面用于图片上传预览功能 function setImagePreview() { va ...

  7. NLP--- How to install the tool NLTK in Ubuntu ?

    NLP--- How to install the tool NLTK in Ubuntu ? 1. open the website of NLTK and download it.  https: ...

  8. Hadoop【单机安装-测试程序WordCount】

    Hadoop程序说明,就是创建一个文本文件,然后统计这个文本文件中单词出现过多少次! (MapReduce 运行在本地   启动JVM ) 第一步    创建需要的文件目录,然后进入该文件中进行编辑 ...

  9. 【AI】微软人工智能学习笔记(二)

    微软Azure机器学习服务 01|机器学习概述 首先上一张图, 这个图里面的大神是谁我也不清楚反正,但是看起来这句话说得很有哲理就贴出来了. 所以在人工智能领域下面的这个机器学习,到底是一个什么样的概 ...

  10. Google advertiser api开发概述——最佳做法&建议

    最佳做法 本指南介绍了一些最佳做法,您可以运用它们来优化 AdWords API 应用的效率和性能. 日常维护 为确保您的应用不间断运行,可采取以下做法: 确保 AdWords API 中心中的开发者 ...