clearias.com
robots.txt

Robots Exclusion Standard data for clearias.com

Resource Scan

Scan Details

Site Domain clearias.com
Base Domain clearias.com
Scan Status Ok
Last Scan2025-04-05T22:47:39+00:00
Next Scan 2025-04-12T22:47:39+00:00

Last Scan

Scanned2025-04-05T22:47:39+00:00
URL https://clearias.com/robots.txt
Domain IPs 104.21.86.4, 172.67.213.80, 2606:4700:3030::6815:5604, 2606:4700:3031::ac43:d550
Response IP 104.21.86.4
Found Yes
Hash ceb98d849cdb45248e8470a9199ca78cb812127693a83ed309dc94ac3048da54
SimHash 481fc4c38123

Groups

*

Rule Path
Disallow /wp-admin/
Allow /wp-admin/admin-ajax.php
Allow /up/

*

Rule Path
Disallow /?s=
Disallow /search/

mediapartners-google

Rule Path
Disallow

mj12bot

Rule Path
Disallow /

mozilla/5.0 (compatible; ezooms/1.0; ezooms.bot@gmail.com)

Rule Path
Disallow /

mxbot

Rule Path
Disallow /

heritrix

Rule Path
Disallow /

exabot

Rule Path
Disallow /

sosospider

Rule Path
Disallow /

sosoimagespider

Rule Path
Disallow /

rdfbot

Rule Path
Disallow /

camontspider

Rule Path
Disallow /

sosospider+(+http://help.soso.com/webspider.htm)

Rule Path
Disallow /

mozilla/5.0 (compatible; ahrefsbot/4.0; +http://ahrefs.com/robot/)

Rule Path
Disallow /

mozilla/5.0 (compatible; exabot/3.0; +http://www.exabot.com/go/robot)

Rule Path
Disallow /

mozilla/5.0 (compatible; proximic; +http://www.proximic.com/info/spider.php)

Rule Path
Disallow /

rogerbot/1.0 (http://www.seomoz.org/dp/rogerbot, rogerbot-crawler@seomoz.org)

Rule Path
Disallow /

sogou web spider/4.0(+http://www.sogou.com/docs/help/webmasters.htm

Product Comment
sogou web spider/4.0(+http://www.sogou.com/docs/help/webmasters.htm 07)
Rule Path
Disallow /

mozilla/5.0 (compatible; mail.ru_bot/2.0; +http://go.mail.ru/help/robots)

Rule Path
Disallow /

curl/7.21.0 (x86_64-pc-linux-gnu) libcurl/7.21.0 openssl/0.9.8o zlib/1.2.3.4 libidn/1.15 libssh2/1.2.6

Rule Path
Disallow /

mozilla/5.0 (compatible; genieo/1.0 http://www.genieo.com/webfilter.html)

Rule Path
Disallow /

mozilla/4.0 (compatible;)

Rule Path
Disallow /

mozilla/5.0 (compatible; jikespider; +http://shoulu.jike.com/spider.html)

Rule Path
Disallow /

seznambot/3.0 (+http://fulltext.sblog.cz/)

Rule Path
Disallow /

mozilla/5.0 (compatible; paperlibot/2.1; http://support.paper.li/entries/20023257-what-is-paper-li)

Rule Path
Disallow /

seznambot/3.0 (+http://fulltext.sblog.cz/)

Rule Path
Disallow /

mozilla/5.0 (compatible; sistrix crawler; http://crawler.sistrix.net/)

Rule Path
Disallow /

coccoc/1.0 (http://help.coccoc.vn/)

Rule Path
Disallow /

bytespider

Rule Path
Disallow /

yandexbot

Rule Path
Disallow /

baiduspider

Rule Path
Disallow /

facebookexternalhit

Rule Path
Disallow /

Other Records

Field Value
sitemap https://www.clearias.com/sitemap_index.xml
sitemap https://www.clearias.com/post-sitemap1.xml
sitemap https://www.clearias.com/post-sitemap2.xml
sitemap https://www.clearias.com/page-sitemap.xml
sitemap https://www.clearias.com/category-sitemap.xml
sitemap https://www.clearias.com/post_tag-sitemap.xml
sitemap https://www.clearias.com/video-sitemap.xml

Comments

  • User-Agent: Yandex
  • Disallow: /

Warnings

  • 2 invalid lines.