wabetainfo.com
robots.txt

Robots Exclusion Standard data for wabetainfo.com

Resource Scan

Scan Details

Site Domain wabetainfo.com
Base Domain wabetainfo.com
Scan Status Ok
Last Scan2024-11-17T03:06:14+00:00
Next Scan 2024-11-24T03:06:14+00:00

Last Scan

Scanned2024-11-17T03:06:14+00:00
URL https://wabetainfo.com/robots.txt
Domain IPs 104.21.44.246, 172.67.205.226, 2606:4700:3033::6815:2cf6, 2606:4700:3035::ac43:cde2
Response IP 172.67.205.226
Found Yes
Hash c15722059f35ef115fe6b0e9c00d78bb15488f4b6abe348cffe973f3e6478137
SimHash 497ccad25499

Groups

googlebot

Rule Path
Allow /

googlebot-news

Rule Path
Allow /

googlebot-image

Rule Path
Allow /

googlebot-video

Rule Path
Allow /

googlebot-mobile

Rule Path
Allow /

mediapartners-google

Rule Path
Allow /

adsbot-google

Rule Path
Allow /

bingbot

Rule Path
Allow /

applebot

Rule Path
Allow /

twitterbot

Rule Path
Allow /

facebookexternalhit

Rule Path
Allow /

duckduckbot

Rule Path
Allow /

slurp

Rule Path
Allow /

ia_archiver

Rule Path
Allow /

archive.org_bot

Rule Path
Allow /

ia_archiver-web-archive.org

Rule Path
Allow /

*

Rule Path
Disallow /

Other Records

Field Value
crawl-delay 20

Other Records

Field Value
sitemap https://wabetainfo.com/sitemap_index.xml

Comments

  • START YOAST BLOCK
  • ---------------------------
  • ---------------------------
  • END YOAST BLOCK