joshibujapan.com
robots.txt
Robots Exclusion Standard data for joshibujapan.com
Resource Scan
Scan Details
Site Domain | joshibujapan.com |
Base Domain | joshibujapan.com |
Scan Status | Ok |
Last Scan | 2025-10-18T16:38:30+00:00 |
Next Scan | 2025-11-01T16:38:30+00:00 |
Last Scan
Scanned | 2025-10-18T16:38:30+00:00 |
URL | https://joshibujapan.com/robots.txt |
Domain IPs | 18.176.100.108 |
Response IP | 18.176.100.108 |
Found | Yes |
Hash | dc2413c4db3c8991c19e6a1c6801367728aa7b48ea8c61dbc8aea34c7469f1a1 |
SimHash | e11dc8e2c3d1 |
Groups
*
Rule | Path |
---|---|
Allow | /n/* |
Allow | /m/* |
Allow | /p/* |
Allow | /archives/* |
Allow | /followings |
Allow | /followers |
Allow | /likes |
Allow | /membership/* |
Allow | /sitemap.xml.gz |
Disallow | /*/ |
Disallow | /embed/* |
Disallow | /intent/* |
Disallow | /m/*/archive |
Other Records
Field | Value |
---|---|
sitemap | https://joshibujapan.com/sitemap.xml.gz |