capassion.in
robots.txt

Robots Exclusion Standard data for capassion.in

Resource Scan

Scan Details

Site Domain capassion.in
Base Domain capassion.in
Scan Status Ok
Last Scan2024-11-12T00:18:31+00:00
Next Scan 2024-11-19T00:18:31+00:00

Last Scan

Scanned2024-11-12T00:18:31+00:00
URL https://www.capassion.in/robots.txt
Domain IPs 2404:6800:4003:c1a::79, 74.125.200.121
Response IP 74.125.68.121
Found Yes
Hash 36ba9eb1ac472f85b9f8b93dea777ac89b1ed426c5cb73a6f780cb0aced14200
SimHash 2f049f425c33

Groups

*

Rule Path
Disallow /search
Allow /

mediapartners-google
google-display-ads-bot

No rules defined. All paths allowed.

Other Records

Field Value
sitemap https://www.capassion.in/atom.xml?redirect=false&start-index=1&max-results=500
sitemap https://www.capassion.in/atom.xml?redirect=false&start-index=501&max-results=500
sitemap https://www.capassion.in/atom.xml?redirect=false&start-index=1001&max-results=500
sitemap https://www.capassion.in/atom.xml?redirect=false&start-index=1501&max-results=500
sitemap https://www.capassion.in/feeds/posts/default?orderby=UPDATED
sitemap https://www.capassion.in/sitemap.xml