community.whattoexpect.com
robots.txt

Robots Exclusion Standard data for community.whattoexpect.com

Resource Scan

Scan Details

Site Domain community.whattoexpect.com
Base Domain whattoexpect.com
Scan Status Ok
Last Scan2024-06-15T06:43:51+00:00
Next Scan 2024-06-29T06:43:51+00:00

Last Scan

Scanned2024-06-15T06:43:51+00:00
URL https://community.whattoexpect.com/robots.txt
Domain IPs 23.54.118.40, 23.54.118.53
Response IP 23.44.4.184
Found Yes
Hash 37b72ff0026f4fb93b6b8045a7a732ffc4f65c09596d90ec2d7fcb5da214629b
SimHash 8d099d54ed90

Groups

gptbot

Rule Path
Disallow /

perplexitybot

Rule Path
Disallow /

claudebot

Rule Path
Disallow /

claude-web

Rule Path
Disallow /

*

Rule Path
Disallow /Index.aspx?puid=D16F18B4-73A8-48F5-B506-A9A4CE0119FF*
Disallow /Index.aspx?puid=CFB00753-F97C-4587-966D-DE9ECE6BEBDC*
Disallow /baby-pictures/
Disallow /destroy/
Disallow /post/
Disallow /infoheavy.html
Disallow /imageheavy.html
Disallow /?order=
Disallow /?filter=
Disallow /aolcat%3D
Disallow /?RedirectUrl
Disallow /?redirecturl
Disallow /login/
Disallow /apple-app-site-association
Disallow /archives/
Disallow /updateutp*
Disallow /updateUtp*

Other Records

Field Value
sitemap https://www.whattoexpect.com/sitemap.full.xml
sitemap https://community.whattoexpect.com/forums/sitemap.xml
sitemap https://community.whattoexpect.com/posts/sitemap/sitemap.xml
sitemap https://community.whattoexpect.com/posts/sitemap/sitemap-recents.xml
sitemap https://www.whattoexpect.com/news/sitemap.xml
sitemap https://registry.whattoexpect.com/baby-registry/sitemap.xml