allthingschildcare.com
robots.txt

Robots Exclusion Standard data for allthingschildcare.com

Resource Scan

Scan Details

Site Domain allthingschildcare.com
Base Domain allthingschildcare.com
Scan Status Ok
Last Scan2024-09-23T11:22:37+00:00
Next Scan 2024-09-30T11:22:37+00:00

Last Scan

Scanned2024-09-23T11:22:37+00:00
URL https://allthingschildcare.com/robots.txt
Redirect https://parentporch.com/robots.txt
Redirect Domain parentporch.com
Redirect Base parentporch.com
Domain IPs 104.21.71.199, 172.67.171.157, 2606:4700:3034::ac43:ab9d, 2606:4700:3037::6815:47c7
Redirect IPs 104.21.50.54, 172.67.201.140, 2606:4700:3034::ac43:c98c, 2606:4700:3035::6815:3236
Response IP 104.21.50.54
Found Yes
Hash eb9175617bfd79ac3ecc066b29c4ad6df75a8ade8d5a0765cf74f3446c05f66c
SimHash 49b64840eec3

Groups

*

Rule Path
Disallow /wp-admin/
Allow /wp-admin/admin-ajax.php

*

Rule Path
Disallow /wp-admin/
Disallow /wp-includes/
Disallow /cgi-bin/
Disallow /private/

ahrefsbot

Rule Path
Disallow

googlebot

Rule Path
Disallow

bingbot

Rule Path
Disallow

yandexbot

Rule Path
Disallow

duckduckbot

Rule Path
Disallow

*

Rule Path
Disallow /tmp/

Other Records

Field Value
sitemap https://parentporch.com/sitemap_index.xml

Comments

  • Allow all bots to crawl the site
  • Allow AhrefsBot to crawl everything
  • Allow Googlebot to crawl everything
  • Allow Bingbot to crawl everything
  • Allow YandexBot to crawl everything
  • Allow DuckDuckBot to crawl everything
  • Block all bots from crawling a specific folder (example: /tmp/)