handspeak.com
robots.txt

Robots Exclusion Standard data for handspeak.com

Resource Scan

Scan Details

Site Domain handspeak.com
Base Domain handspeak.com
Scan Status Ok
Last Scan2024-11-12T03:20:19+00:00
Next Scan 2024-11-19T03:20:19+00:00

Last Scan

Scanned2024-11-12T03:20:19+00:00
URL https://handspeak.com/robots.txt
Redirect https://www.handspeak.com/robots.txt
Redirect Domain www.handspeak.com
Redirect Base handspeak.com
Domain IPs 162.246.254.209
Redirect IPs 162.246.254.209
Response IP 162.246.254.209
Found Yes
Hash 8e6685069b0f71d79fba5638feecf116bb7a318499dffb86f171bb4b6fb2b6af
SimHash 6a562841c1b6

Groups

*

Rule Path
Disallow /

anthropic-ai

Rule Path
Disallow /

claude-web

Rule Path
Disallow /

ccbot

Rule Path
Disallow /

facebookbot

Rule Path
Disallow /

google-extended

Rule Path
Disallow /

gptbot

Rule Path
Disallow /

piplbot

Rule Path
Disallow /

applebot-extended

Rule Path
Disallow /

googlebot-image

Rule Path
Allow /word/
Allow /learn/

bingbot
googlebot

Rule Path
Allow /
Disallow /member/*

archive.org_bot

Rule Path
Disallow /member

yahoo
duckduckbot

Rule Path
Allow /$
Allow /word/$
Allow /word/*/$
Allow /learn/$
Allow /learn/*/$
Disallow /member/*

doubleverifybot

Rule Path
Allow /
Disallow /member

dvbot

Rule Path
Allow
Disallow /member

leikibot

Rule Path
Allow /
Disallow /member

ias_crawler

Rule Path
Allow /
Disallow /member

ias-

Rule Path
Allow /
Disallow /member

grapeshot

Rule Path
Allow /
Disallow /member

opebot-v (https://www.1plusx.com (https://www.1plusx.com/))

Rule Path
Allow /
Disallow /member

Other Records

Field Value
sitemap https://www.handspeak.com/sitemap.xml

Comments

  • disallow all agents and files
  • allow adsense bot on entire site
  • User-agent: Googlebot
  • Disallow: /nogooglebot/
  • allow google bot on entire site
  • allow google bot on entire site
  • Allow: /
  • allow agents to crawl some
  • Allow DoubleVerifyBot and DVBot for Rapti start
  • Allow Leikibot
  • Allow IAS Crawler (Integral Ad Science)
  • Allow IAS Admantx crawler
  • Allow Grapeshot Crawler
  • Allow https://www.1plusx.com for Rapti END