holdsport.dk
robots.txt

Robots Exclusion Standard data for holdsport.dk

Resource Scan

Scan Details

Site Domain holdsport.dk
Base Domain holdsport.dk
Scan Status Ok
Last Scan2024-05-19T10:13:26+00:00
Next Scan 2024-05-26T10:13:26+00:00

Last Scan

Scanned2024-05-19T10:13:26+00:00
URL https://holdsport.dk/robots.txt
Domain IPs 104.26.8.15, 104.26.9.15, 172.67.69.131, 2606:4700:20::681a:80f, 2606:4700:20::681a:90f, 2606:4700:20::ac43:4583
Response IP 104.26.9.15
Found Yes
Hash d686469bd4bb5277eb174cf728a287ac1d5707e89670b751b20d0de2a4f8908f
SimHash 8c72f7789955

Groups

*

Rule Path
Disallow /*_tid%3D*
Disallow /*.pdf*
Disallow /*.sfw%3D*
Disallow /*.xls%3D*
Disallow /*.doc%3D*
Disallow /3rdpartad.html
Disallow /clubs/new
Disallow /php/
Disallow /fb
Disallow /fb.mobile
Disallow /top_iframe_isense_dotdk.html
Disallow /klub/*
Disallow /hold/*
Disallow /sign_in/*
Disallow /category_products/*
Disallow /*.xls
Disallow /*.xlsx
Disallow /*.doc
Disallow /*.docx

mediapartners-google

Rule Path
Disallow

Comments

  • For Google AdSense to crawl the page