thesimpleparent.com
robots.txt

Robots Exclusion Standard data for thesimpleparent.com

Resource Scan

Scan Details

Site Domain thesimpleparent.com
Base Domain thesimpleparent.com
Scan Status Ok
Last Scan2024-10-04T03:29:35+00:00
Next Scan 2024-10-11T03:29:35+00:00

Last Scan

Scanned2024-10-04T03:29:35+00:00
URL https://thesimpleparent.com/robots.txt
Domain IPs 104.21.10.21, 172.67.162.33, 2606:4700:3030::ac43:a221, 2606:4700:3033::6815:a15
Response IP 104.21.10.21
Found Yes
Hash 391e2744a4ae8fb0db2acf6a2fb9981163774d7f2fe51f4c4c661b96c286e922
SimHash 59645ac0a593

Groups

chatgpt-user

Rule Path
Disallow /

gptbot

Rule Path
Disallow /

anthropic-ai

Rule Path
Disallow /

claude-web

Rule Path
Disallow /

piplbot

Rule Path
Disallow /

ccbot

Rule Path
Disallow /

facebookbot

Rule Path
Disallow /

*

Rule Path
Disallow

Other Records

Field Value
sitemap https://thesimpleparent.com/sitemap_index.xml

Comments

  • ======Raptive Begin======
  • ======Raptive End======
  • START YOAST BLOCK
  • ---------------------------
  • ---------------------------
  • END YOAST BLOCK