sustainablemaintainance.com
robots.txt

Robots Exclusion Standard data for sustainablemaintainance.com

Resource Scan

Scan Details

Site Domain sustainablemaintainance.com
Base Domain sustainablemaintainance.com
Scan Status Ok
Last Scan2024-11-16T10:43:01+00:00
Next Scan 2024-11-23T10:43:01+00:00

Last Scan

Scanned2024-11-16T10:43:01+00:00
URL https://www.sustainablemaintainance.com/robots.txt
Domain IPs 104.21.3.192, 172.67.131.31, 2606:4700:3031::ac43:831f, 2606:4700:3033::6815:3c0
Response IP 104.21.3.192
Found Yes
Hash 472dce73395af5975dea0931e2ced9fd8a62c8834a8c0b88b07f90deaa00a316
SimHash 3a154612c5b0

Groups

mediapartners-google

Rule Path
Disallow

*

Rule Path
Disallow /search*
Disallow /20*
Allow /*.html

Other Records

Field Value
sitemap https://www.sustainablemaintainance.blogspot.com/sitemap.xml
sitemap https://www.sustainablemaintainance.blogspot.com/sitemap-pages.xml

Comments

  • below lines control all search engines, and blocks all search, archieve and allow all blog posts and pages.
  • sitemap of the blog