beingassistant.com
robots.txt

Robots Exclusion Standard data for beingassistant.com

Resource Scan

Scan Details

Site Domain beingassistant.com
Base Domain beingassistant.com
Scan Status Ok
Last Scan2025-10-10T07:04:19+00:00
Next Scan 2025-10-17T07:04:19+00:00

Last Scan

Scanned2025-10-10T07:04:19+00:00
URL https://beingassistant.com/robots.txt
Domain IPs 104.21.65.251, 172.67.196.78, 2606:4700:3036::6815:41fb, 2606:4700:3036::ac43:c44e
Response IP 104.21.65.251
Found Yes
Hash 336ebc51d46adee31421927ebeb26547930de9208273c7d8e6aa7a4b2c543711
SimHash 4a04de8276b6

Groups

googlebot

Rule Path
Disallow

googlebot-image

Rule Path
Disallow

googlebot-mobile

Rule Path
Disallow

googlebot-news

Rule Path
Disallow

googlebot-video

Rule Path
Disallow

ahrefsbot

Rule Path
Disallow /

semrushbot

Rule Path
Disallow /

mj12bot

Rule Path
Disallow /

dotbot

Rule Path
Disallow /

yandex

Rule Path
Disallow /

baiduspider

Rule Path
Disallow /

sogou

Rule Path
Disallow /

petalbot

Rule Path
Disallow /

mauibot

Rule Path
Disallow /

blexbot

Rule Path
Disallow /

seekportbot

Rule Path
Disallow /

spbot

Rule Path
Disallow /

megaindex

Rule Path
Disallow /

*

Rule Path
Disallow /cdn-cgi/*
Disallow /wp-admin/
Allow /wp-admin/admin-ajax.php

Other Records

Field Value
crawl-delay 10

Other Records

Field Value
sitemap https://beingassistant.com/sitemap_index.xml

Comments

  • robots.txt for WordPress Blog
  • Allow all Google bots
  • Block common bad bots
  • General rules for all crawlers
  • Sitemap