docs.splunk.com
robots.txt

Robots Exclusion Standard data for docs.splunk.com

Resource Scan

Scan Details

Site Domain docs.splunk.com
Base Domain splunk.com
Scan Status Failed
Failure StageFetching resource.
Failure ReasonServer returned a client error.
Last Scan2025-10-06T23:59:27+00:00
Next Scan 2025-12-05T23:59:27+00:00

Last Successful Scan

Scanned2025-07-16T21:42:48+00:00
URL https://docs.splunk.com/robots.txt
Domain IPs 52.40.190.39, 52.42.47.151
Response IP 52.42.47.151
Found Yes
Hash 6ff3ee74a0c0512e2cec23f516229b998ceb81089daa3221219e60f400d60756
SimHash 2810613a0ef1

Groups

gsa-crawler-splunk

Rule Path Comment
Disallow /index.php Do not allow index.php with any parameters after it

lucidworks-splkext

Rule Path Comment
Disallow /index.php Do not allow index.php with any parameters after it

lucidworks-splkent

Rule Path Comment
Disallow /index.php Do not allow index.php with any parameters after it

*

Rule Path Comment
Disallow /*?* exclude any URL with a query string
Disallow /index.php Do not allow index.php with any parameters after it
Disallow /skins/ No need to index the skins directory
Disallow /Special%3A Do not index Special pages
Disallow /Category%3A No need to index Category page listings
Disallow /Documentation%3A No need to index pages like Documentation:Versions and Documentation:Manuals
Disallow /Documentation_talk%3A No need to index talk pages, we use comments now
Disallow /User%3A No need to index user space
Disallow /File%3A No need to index File namespace
Disallow /images/*.pdf No need to index uploaded pdf files
Disallow /Documentation/DSP/* -

Comments

  • Splunk Documentation