nasscomforstartups.in
robots.txt

Robots Exclusion Standard data for nasscomforstartups.in

Resource Scan

Scan Details

Site Domain nasscomforstartups.in
Base Domain nasscomforstartups.in
Scan Status Failed
Failure StageFetching resource.
Failure ReasonRequest timed out.
Last Scan2024-09-30T07:29:45+00:00
Next Scan 2024-12-29T07:29:45+00:00

Last Successful Scan

Scanned2023-08-28T17:22:10+00:00
URL https://nasscomforstartups.in/robots.txt
Domain IPs 163.172.178.119
Response IP 163.172.178.119
Found Yes
Hash 5559dad219e100852aab3389431e52ae607ddb70fcf72088eab7d1434d833f92
SimHash 0945980169fb

Groups

*

Rule Path
Disallow /pick-institution
Disallow /terms
Disallow /privacy-policy
Disallow /legal
Disallow /backoffice
Disallow /networks/*/recruiter/jobs

mj12bot

Rule Path
Disallow /

semrushbot

Rule Path
Disallow /

ut-dorkbot

Rule Path
Disallow /

ut-dorkbot/1.0

Rule Path
Disallow /

Other Records

Field Value
sitemap https://nasscom.hivebrite.com/sitemap.xml