pathwalla.com
robots.txt

Robots Exclusion Standard data for pathwalla.com

Resource Scan

Scan Details

Site Domain pathwalla.com
Base Domain pathwalla.com
Scan Status Ok
Last Scan2025-12-15T14:33:37+00:00
Next Scan 2025-12-22T14:33:37+00:00

Last Scan

Scanned2025-12-15T14:33:37+00:00
URL https://pathwalla.com/robots.txt
Redirect https://www.pathwalla.com/robots.txt
Redirect Domain www.pathwalla.com
Redirect Base pathwalla.com
Domain IPs 216.239.32.21, 216.239.34.21, 216.239.36.21, 216.239.38.21
Redirect IPs 2404:6800:4003:c03::79, 74.125.130.121
Response IP 74.125.68.121
Found Yes
Hash 1965def82b6655387426d4cf4d55ddb3ab621f7450a38a70663dd74cc012a62e
SimHash c9349b81e4b2

Groups

*

Rule Path
Disallow /search
Disallow /category/
Disallow /label/
Disallow /tag/
Disallow /*?m=1
Disallow /*?showComment
Allow /

Other Records

Field Value
sitemap https://www.pathwalla.com/atom.xml?redirect=false&start-index=1&max-results=500
sitemap https://www.pathwalla.com/atom.xml?redirect=false&start-index=501&max-results=500
sitemap https://www.pathwalla.com/atom.xml?redirect=false&start-index=1001&max-results=500
sitemap https://www.pathwalla.com/atom.xml?redirect=false&start-index=1501&max-results=500
sitemap https://www.pathwalla.com/atom.xml?redirect=false&start-index=2001&max-results=500
sitemap https://www.pathwalla.com/atom.xml?redirect=false&start-index=2501&max-results=500
sitemap https://www.pathwalla.com/atom.xml?redirect=false&start-index=3001&max-results=500