claudedaycare.com
robots.txt

Robots Exclusion Standard data for claudedaycare.com

Resource Scan

Scan Details

Site Domain claudedaycare.com
Base Domain claudedaycare.com
Scan Status Ok
Last Scan2024-09-21T17:20:32+00:00
Next Scan 2024-09-28T17:20:32+00:00

Last Scan

Scanned2024-09-21T17:20:32+00:00
URL https://www.claudedaycare.com/robots.txt
Domain IPs 142.251.10.121, 2404:6800:4003:c1c::79
Response IP 74.125.130.121
Found Yes
Hash 9b11520169291c4014d68471f65397df82aaf0370e2b76e9f2fe2e0314eccba1
SimHash 98555bb24326

Groups

mediapartners-google

Rule Path
Disallow

*

Rule Path
Disallow /search*
Disallow /20*
Allow /*.html
Disallow ez3k.lovestoblog.com
Allow /

Other Records

Field Value
sitemap https://www.claudedaycare.com/feeds/posts/default?orderby=UPDATED
sitemap https://www.claudedaycare.com/sitemap.xml
sitemap https://www.claudedaycare.com/sitemap-pages.xml
sitemap https://www.claudedaycare.com/atom.xml?redirect=false&start-index=1&max-result=500
sitemap https://www.claudedaycare.com/atom.xml?redirect=false&start-index=501&max-result=500
sitemap https://www.claudedaycare.com/atom.xml?redirect=false&start-index=1001&max-result=500

Comments

  • below lines control all search engines, and blocks all search, archive and allow all blog posts and pages.
  • sitemap of the blog
  • sitemap of the blog - search-console - developers.google.com - ez3k
  • disqus mediapartners disqus.com,