gakuranman.com
robots.txt
Robots Exclusion Standard data for gakuranman.com
Resource Scan
Scan Details
| Site Domain | gakuranman.com |
| Base Domain | gakuranman.com |
| Scan Status | Failed |
| Failure Stage | Fetching resource. |
| Failure Reason | Couldn't connect to server. |
| Last Scan | 2025-12-08T21:16:31+00:00 |
| Next Scan | 2026-03-08T21:16:31+00:00 |
Last Successful Scan
| Scanned | 2024-01-26T05:09:25+00:00 |
| URL | https://gakuranman.com/robots.txt |
| Domain IPs | 192.185.48.233 |
| Response IP | 192.185.48.233 |
| Found | Yes |
| Hash | 0c1f96850bb64bff02d1d0270c2d6bd61bfcd5c68ae6ab81a0cbc4061938929a |
| SimHash | 69204963ee99 |
Groups
*
| Rule | Path |
|---|---|
| Disallow | /feed/ |
| Disallow | /cgi-bin/ |
| Disallow | /wp-admin/ |
| Disallow | /wp-content/ |
| Disallow | /wp-includes/ |
| Disallow | /trackback/ |
| Disallow | /go/ |
| Disallow | /out/ |
| Allow | /wp-content/uploads/ |
Other Records
| Field | Value |
|---|---|
| sitemap | http://gakuran.com/sitemap.xml |
Warnings
- 2 invalid lines.