iisc-institute.de
robots.txt

Robots Exclusion Standard data for iisc-institute.de

Resource Scan

Scan Details

Site Domain iisc-institute.de
Base Domain iisc-institute.de
Scan Status Ok
Last Scan2025-08-26T11:42:55+00:00
Next Scan 2025-09-25T11:42:55+00:00

Last Scan

Scanned2025-08-26T11:42:55+00:00
URL https://iisc-institute.de/robots.txt
Domain IPs 2a01:238:20a:202:1165::, 81.169.145.165
Response IP 81.169.145.165
Found Yes
Hash 33e6007181b71c9e4ccb1763c60b3b64e3239dd7e8b3d9d2ab8a0a7d87f0e035
SimHash 2711c614efb3

Groups

*

Rule Path
Disallow *?gtm
Disallow *?utm
Disallow *?gclid
Disallow *?from
Disallow *s%3D
Disallow *p%3D
Disallow /wp-json/
Disallow /wp-content/
Disallow /payment
Disallow /zahlung
Allow /wp-content/*.jpeg
Allow /wp-content/*.jpg
Allow /wp-content/*.png
Allow /wp-content/*.js
Allow /wp-content/*.css
Allow /wp-content/*.woff

Other Records

Field Value
sitemap https://iisc-institute.de/sitemap_index.xml

Warnings

  • `host` is not a known field.