whattoexpect.com
robots.txt

Robots Exclusion Standard data for whattoexpect.com

Resource Scan

Scan Details

Site Domain whattoexpect.com
Base Domain whattoexpect.com
Scan Status Ok
Last Scan2024-11-02T06:16:05+00:00
Next Scan 2024-11-09T06:16:05+00:00

Last Scan

Scanned2024-11-02T06:16:05+00:00
URL https://whattoexpect.com/robots.txt
Redirect https://www.whattoexpect.com:443/robots.txt
Redirect Domain www.whattoexpect.com
Redirect Base whattoexpect.com
Domain IPs 52.72.204.96, 54.156.205.43
Redirect IPs 96.17.96.10, 96.17.96.31
Response IP 23.54.118.10
Found Yes
Hash 4ac31a6c6faa9f9520921123605c6a063823314b18d75afc75ab28cff7586ac7
SimHash 8d08cd54e5d2

Groups

gptbot

Rule Path
Disallow /

perplexitybot

Rule Path
Disallow /

claudebot

Rule Path
Disallow /

claude-web

Rule Path
Disallow /

*

Rule Path
Disallow /Index.aspx?puid=D16F18B4-73A8-48F5-B506-A9A4CE0119FF*
Disallow /Index.aspx?puid=CFB00753-F97C-4587-966D-DE9ECE6BEBDC*
Disallow /baby-pictures/
Disallow /destroy/
Disallow /post/
Disallow /infoheavy.html
Disallow /imageheavy.html
Disallow /?order=
Disallow /?filter=
Disallow /aolcat%3D
Disallow /?RedirectUrl
Disallow /?redirecturl
Disallow /apple-app-site-association
Disallow /archives/
Disallow /updateutp*
Disallow /updateUtp*
Disallow /logoff
Disallow /api/search/
Disallow /register/due-date-calculator/

Other Records

Field Value
sitemap https://www.whattoexpect.com/sitemap.full.xml
sitemap https://community.whattoexpect.com/forums/sitemap.xml
sitemap https://community.whattoexpect.com/posts/sitemap/sitemap.xml
sitemap https://community.whattoexpect.com/posts/sitemap/sitemap-recents.xml
sitemap https://www.whattoexpect.com/news/sitemap.xml
sitemap https://registry.whattoexpect.com/baby-registry/sitemap.xml