harappa.or.jp
robots.txt

Robots Exclusion Standard data for harappa.or.jp

Resource Scan

Scan Details

Site Domain harappa.or.jp
Base Domain harappa.or.jp
Scan Status Ok
Last Scan2026-02-26T02:56:59+00:00
Next Scan 2026-03-12T02:56:59+00:00

Last Scan

Scanned2026-02-26T02:56:59+00:00
URL https://harappa.or.jp/robots.txt
Domain IPs 85.131.207.92
Response IP 85.131.207.92
Found Yes
Hash fdc91eca9111eeacf8a85fc4f532c70510e63c7bbf312393c5108d923c24dbc8
SimHash 69009820c1b3

Groups

*

Rule Path
Disallow /wp-admin/
Allow /wp-admin/admin-ajax.php

gptbot

Rule Path
Disallow /

Other Records

Field Value
sitemap https://harappa.or.jp/wp-sitemap.xml