cdc.unsri.ac.id
robots.txt

Robots Exclusion Standard data for cdc.unsri.ac.id

Resource Scan

Scan Details

Site Domain cdc.unsri.ac.id
Base Domain unsri.ac.id
Scan Status Failed
Failure StageFetching resource.
Failure ReasonRequest timed out.
Last Scan2025-03-29T08:26:52+00:00
Next Scan 2025-04-05T08:26:52+00:00

Last Successful Scan

Scanned2025-03-14T04:11:45+00:00
URL https://cdc.unsri.ac.id/robots.txt
Domain IPs 103.121.159.35
Response IP 103.121.159.35
Found Yes
Hash 6adbdb5628ffb9c7634ab03ab7f29bd12faa5e0c1a6e3a309351cff44aeca16a
SimHash b94abf20e320

Groups

*

Rule Path
Allow /
Disallow /assets/
Disallow /images/
Disallow /public/
Disallow /themes/
Disallow /.well-known/
Disallow /data/
Disallow /externals/
Disallow /media/
Disallow /public/

Comments

  • Disallowed Sub-Directories