maprova.dk
robots.txt

Robots Exclusion Standard data for maprova.dk

Resource Scan

Scan Details

Site Domain maprova.dk
Base Domain maprova.dk
Scan Status Ok
Last Scan2025-05-08T23:29:10+00:00
Next Scan 2025-06-07T23:29:10+00:00

Last Scan

Scanned2025-05-08T23:29:10+00:00
URL https://maprova.dk/robots.txt
Domain IPs 94.231.103.172
Response IP 94.231.103.172
Found Yes
Hash 4dfb528984ba983957999d303b918fdad670407e40b93411dc16a7d1923b4691
SimHash 050e09f3c6d7

Groups

*

Rule Path
Disallow /actions/
Disallow /private/
Disallow /.ftp-deploy-sync-state.json
Disallow /error.html
Disallow /404.html
Disallow /cgi-bin/
Disallow /tmp/
Disallow /admin/
Disallow /config/

*

Rule Path
Disallow /*.php$
Disallow /*.log$
Disallow /*.env$
Allow /*.json$
Allow /

Other Records

Field Value
sitemap https://maprova.dk/page-sitemap.xml

Comments

  • robots.txt for maprova.dk
  • Allow all bots to access the main website
  • Block specific file types from being crawled
  • Allow access to the sitemap
  • Allow crawling of important resources