khabriya.in
robots.txt

Robots Exclusion Standard data for khabriya.in

Resource Scan

Scan Details

Site Domain khabriya.in
Base Domain khabriya.in
Scan Status Failed
Failure StageFetching resource.
Failure ReasonServer returned a server error.
Last Scan2024-09-24T21:02:51+00:00
Next Scan 2024-10-08T21:02:51+00:00

Last Successful Scan

Scanned2024-08-17T21:00:38+00:00
URL https://khabriya.in/robots.txt
Domain IPs 104.21.234.228, 104.21.234.229, 2606:4700:3038::6815:eae4, 2606:4700:3038::6815:eae5
Response IP 104.21.234.228
Found Yes
Hash 6d5b7d448895d4eb1fffe50ddc5622586a8783fd6d07eae1efe44e5c0c37c167
SimHash 661094104d90

Groups

*

Rule Path
Disallow /?page=*

Other Records

Field Value
sitemap https://khabriya.in/sitemap.xml
sitemap https://khabriya.in/sitemap-1.xml
sitemap https://khabriya.in/sitemap-2.xml
sitemap https://khabriya.in/sitemap-3.xml

Comments

  • https://www.robotstxt.org/robotstxt.html
  • We're experimenting with blocking search results to prevent search result spam