marcelohorta.com
robots.txt
Robots Exclusion Standard data for marcelohorta.com
Resource Scan
Scan Details
| Site Domain | marcelohorta.com |
| Base Domain | marcelohorta.com |
| Scan Status | Ok |
| Last Scan | 2025-12-22T23:10:31+00:00 |
| Next Scan | 2026-01-21T23:10:31+00:00 |
Last Scan
| Scanned | 2025-12-22T23:10:31+00:00 |
| URL | https://marcelohorta.com/robots.txt |
| Domain IPs | 104.21.6.154, 172.67.134.251, 2606:4700:3033::6815:69a, 2606:4700:3037::ac43:86fb |
| Response IP | 104.21.6.154 |
| Found | Yes |
| Hash | dae10a54684e6da3104c23b85ed6f36daea7bf8af25d4ebd8a4d231a2c87075c |
| SimHash | d9e1a8008d93 |
Groups
*
| Rule | Path | Comment |
|---|---|---|
| Disallow | /pdfs/ | Block the /pdfs/directory |
| Disallow | *.pdf$ | Block pdf files from all bots. Albeit non-standard, it works for major search engines |
| Disallow | /wp-admin/ | - |
| Allow | /wp-admin/admin-ajax.php | - |
Other Records
| Field | Value |
|---|---|
| sitemap | https://marcelohorta.com/sitemap_index.xml |
Comments