rupahuq.org.uk
robots.txt
Robots Exclusion Standard data for rupahuq.org.uk
Resource Scan
Scan Details
Site Domain | rupahuq.org.uk |
Base Domain | rupahuq.org.uk |
Scan Status | Ok |
Last Scan | 2024-09-14T15:28:52+00:00 |
Next Scan | 2024-10-14T15:28:52+00:00 |
Last Scan
Scanned | 2024-09-14T15:28:52+00:00 |
URL | https://www.rupahuq.org.uk/robots.txt |
Domain IPs | 35.189.79.96 |
Response IP | 35.189.79.96 |
Found | Yes |
Hash | 7c9e6f2b9a4248e7a265a439bf99087a7575c50b9dad799333ebcbd8a5df45cc |
SimHash | 8e57cc209cd7 |
Groups
*
Rule | Path |
---|---|
Disallow | /wp-json/ |
*
Rule | Path |
---|---|
Disallow | /feed/ |
*
Rule | Path |
---|---|
Disallow | /?author=1 |
*
Rule | Path |
---|---|
Disallow | /wp-content/themes/labour-multi-site-theme/manifest.json |
nuclei
wikido
riddler
petalbot
zoominfobot
go-http-client
node/simplecrawler
cazoodlebot
dotbot/1.0
gigabot
barkrowler
blexbot
magpie-crawler
Rule | Path |
---|---|
Disallow | / |
*
No rules defined. All paths allowed.
Other Records
Field | Value |
---|---|
crawl-delay | 10 |
Comments