itsthathard.com
robots.txt

Robots Exclusion Standard data for itsthathard.com

Resource Scan

Scan Details

Site Domain itsthathard.com
Base Domain itsthathard.com
Scan Status Failed
Failure StageFetching resource.
Failure ReasonCouldn't connect to server.
Last Scan2024-09-17T15:06:06+00:00
Next Scan 2024-11-16T15:06:06+00:00

Last Successful Scan

Scanned2024-07-20T10:57:41+00:00
URL https://www.itsthathard.com/robots.txt
Domain IPs 2404:6800:4003:c0f::79, 74.125.68.121
Response IP 74.125.130.121
Found Yes
Hash 758b8eef725beedbca0214caa3be65ba36c9b57bf17f9ed1f120ddded4385113
SimHash 98154e1345b2

Groups

mediapartners-google

Rule Path
Disallow

*

Rule Path
Disallow /search*
Disallow /20*
Allow /*.html
Disallow ez3k.lovestoblog.com
Allow /

Other Records

Field Value
sitemap http://www.itsthathard.com/feeds/posts/default?orderby=UPDATED
sitemap https://www.itsthathard.com/sitemap.xml
sitemap https://www.itsthathard.com/sitemap-pages.xml
sitemap https://www.itsthathard.com/atom.xml?redirect=false&start-index=1&max-result=500
sitemap https://www.itsthathard.com/atom.xml?redirect=false&start-index=501&max-result=500
sitemap https://www.itsthathard.com/atom.xml?redirect=false&start-index=1001&max-result=500

Comments

  • below lines control all search engines, and blocks all search, archieve and allow all blog posts and pages.
  • sitemap of the blog
  • sitemap of the blog - search-console - developers.google.com - ez3k