netapp-prod.mindtouch.us
robots.txt

Robots Exclusion Standard data for netapp-prod.mindtouch.us

Resource Scan

Scan Details

Site Domain netapp-prod.mindtouch.us
Base Domain mindtouch.us
Scan Status Ok
Last Scan2024-09-23T10:42:26+00:00
Next Scan 2024-10-07T10:42:26+00:00

Last Scan

Scanned2024-09-23T10:42:26+00:00
URL https://netapp-prod.mindtouch.us/robots.txt
Redirect https://kb.netapp.com/robots.txt
Redirect Domain kb.netapp.com
Redirect Base netapp.com
Domain IPs 18.155.68.104, 18.155.68.17, 18.155.68.21, 18.155.68.31
Redirect IPs 23.50.86.148, 2600:1413:b000:386::3407, 2600:1413:b000:39a::3407
Response IP 23.66.34.6
Found Yes
Hash 216170c347b09d304ff65c98e9975e00201149f3ed6deadae196927e34505028
SimHash 2d2ce6b24f31

Groups

*

Rule Path
Allow /%40api/deki/files/
Allow /%40api/deki/users/authenticate
Allow /Special%3AUserLogin
Allow /*title%3DSpecial%3AUserLogin
Allow /%40app/auth/
Allow /%40app/saml/
Disallow /Special%3A*
Disallow /*title%3DSpecial%3A*
Disallow /Template%3A*
Disallow /*title%3DTemplate%3A*
Disallow /User%3A*
Disallow /*title%3DUser%3A*
Disallow /deki/
Disallow /*action%3D*
Disallow /%40*

Other Records

Field Value
crawl-delay 5

gptbot

Rule Path
Disallow /

Other Records

Field Value
sitemap https://kb.netapp.com/sitemap.xml

Comments

  • allow file attachments
  • allow GSA authentication
  • block operational (non content) locations
  • block GPT bot

Warnings

  • `request-rate` is not a known field.