guruvu.in
robots.txt

Robots Exclusion Standard data for guruvu.in

Resource Scan

Scan Details

Site Domain guruvu.in
Base Domain guruvu.in
Scan Status Ok
Last Scan2024-09-27T06:04:34+00:00
Next Scan 2024-10-04T06:04:34+00:00

Last Scan

Scanned2024-09-27T06:04:34+00:00
URL https://guruvu.in/robots.txt
Redirect https://www.guruvu.in/robots.txt
Redirect Domain www.guruvu.in
Redirect Base guruvu.in
Domain IPs 2404:6800:4003:c02::79, 74.125.68.121
Redirect IPs 216.239.32.21, 216.239.34.21, 216.239.36.21, 216.239.38.21
Response IP 216.239.36.21
Found Yes
Hash 3a3cc67966ca8738dcdef1a79ba911f644a261a947b0c8eea3779a887cd34716
SimHash 2a3d4e1345b3

Groups

mediapartners-google

Rule Path
Disallow

*

Rule Path
Disallow /search*
Disallow /20*
Allow /*.html

Other Records

Field Value
sitemap https://www.guruvu.in/sitemap.xml
sitemap https://www.guruvu.in/sitemap-pages.xml

Comments

  • below lines control all search engines, and blocks all search, archieve and allow all blog posts and pages.
  • sitemap of the blog