usm.ac.id
robots.txt

Robots Exclusion Standard data for usm.ac.id

Resource Scan

Scan Details

Site Domain usm.ac.id
Base Domain usm.ac.id
Scan Status Ok
Last Scan2025-04-04T22:45:28+00:00
Next Scan 2025-05-04T22:45:28+00:00

Last Scan

Scanned2025-04-04T22:45:28+00:00
URL https://usm.ac.id/robots.txt
Domain IPs 103.134.215.11
Response IP 103.134.215.11
Found Yes
Hash 593053a997d82be7e65a4540aa44668c76894d8dadfcdf1579fe8e6571e1a25d
SimHash 6b0c48400f12

Groups

*

Rule Path
Disallow /wp-admin/
Allow /wp-admin/admin-ajax.php
Disallow /wp-content/uploads/wpo/wpo-plugins-tables-list.json
Disallow /wp-content/plugins/
Disallow /wp-includes/
Disallow /id/?*
Disallow /en/?*

bingbot

Rule Path
Disallow /

Other Records

Field Value
sitemap https://usm.ac.id/sitemap.xml
sitemap https://usm.ac.id/sitemap.rss