steenwijkercourant.nl
robots.txt

Robots Exclusion Standard data for steenwijkercourant.nl

Resource Scan

Scan Details

Site Domain steenwijkercourant.nl
Base Domain steenwijkercourant.nl
Scan Status Ok
Last Scan2024-05-28T09:06:13+00:00
Next Scan 2024-06-04T09:06:13+00:00

Last Scan

Scanned2024-05-28T09:06:13+00:00
URL https://steenwijkercourant.nl/robots.txt
Domain IPs 104.18.38.45, 172.64.149.211, 2606:4700:4400::6812:262d, 2606:4700:4400::ac40:95d3
Response IP 172.64.149.211
Found Yes
Hash 2a8a9575a55143d825e97fb7e96d4051efefa35f9338ca0d9efa117d3b4f8134
SimHash 6a15d872c913

Groups

*

Rule Path
Disallow /most-read
Disallow /tag
Disallow /search
Allow /

gptbot

Rule Path
Disallow /

ccbot

Rule Path
Disallow /

google-extended

Rule Path
Disallow /

Other Records

Field Value
sitemap https://static.steenwijkercourant.nl/sitemap/sitemap.xml.gz
sitemap https://static.steenwijkercourant.nl/sitemap/sitemap_news.xml.gz
sitemap https://static.steenwijkercourant.nl/sitemap/sitemap_sections.xml.gz