papkassen.dk
robots.txt

Robots Exclusion Standard data for papkassen.dk

Resource Scan

Scan Details

Site Domain papkassen.dk
Base Domain papkassen.dk
Scan Status Ok
Last Scan2024-10-04T07:02:27+00:00
Next Scan 2024-10-11T07:02:27+00:00

Last Scan

Scanned2024-10-04T07:02:27+00:00
URL https://papkassen.dk/robots.txt
Redirect https://www.papkassen.dk/robots.txt
Redirect Domain www.papkassen.dk
Redirect Base papkassen.dk
Domain IPs 18.184.117.135
Redirect IPs 18.184.117.135
Response IP 18.184.117.135
Found Yes
Hash 42deb5ba76ecd05d28a5d475f5b55617295421bff70022faff4a0a0f91fdf4d7
SimHash 781ec962ce11

Groups

*

Rule Path
Disallow /config/
Disallow /controllers/
Disallow /includes/
Disallow /logs/
Disallow /tools/
Disallow /views/
Disallow /redirect/

blexbot

Rule Path
Disallow /

semrushbot

Rule Path
Disallow /

semrushbot-sa

Rule Path
Disallow /

mj12bot

Rule Path
Disallow /

vege bot

Rule Path
Disallow /

vegi bot

Rule Path
Disallow /

slurp

No rules defined. All paths allowed.

Other Records

Field Value
crawl-delay 1