hipeac.net
robots.txt

Robots Exclusion Standard data for hipeac.net

Resource Scan

Scan Details

Site Domain hipeac.net
Base Domain hipeac.net
Scan Status Ok
Last Scan2024-09-24T18:53:13+00:00
Next Scan 2024-10-24T18:53:13+00:00

Last Scan

Scanned2024-09-24T18:53:13+00:00
URL https://hipeac.net/robots.txt
Redirect https://www.hipeac.net/robots.txt
Redirect Domain www.hipeac.net
Redirect Base hipeac.net
Domain IPs 104.21.17.78, 172.67.175.76, 2606:4700:3031::6815:114e, 2606:4700:3031::ac43:af4c
Redirect IPs 104.21.17.78, 172.67.175.76, 2606:4700:3031::6815:114e, 2606:4700:3031::ac43:af4c
Response IP 104.21.17.78
Found Yes
Hash 5d3bdfd812672fdcf283ee369499ab37f61521cf088d472a00dbda95ffe94fc2
SimHash 48106931d4b7

Groups

*

Rule Path
Disallow /

googlebot
msnbot
slurp
yahoo-blogs
linkedinbot
twitterbot
facebot

Rule Path
Disallow /accounts/
Disallow /admin/
Disallow /api/
Disallow /assets/private/
Disallow /ec/
Disallow /sc/
Disallow /sympa/

Other Records

Field Value
crawl-delay 600

Comments

  • https://support.google.com/webmasters/answer/6062608?hl=en
  • Disallow all
  • But allow only some bots