ggd.amsterdam.nl
robots.txt

Robots Exclusion Standard data for ggd.amsterdam.nl

Resource Scan

Scan Details

Site Domain ggd.amsterdam.nl
Base Domain amsterdam.nl
Scan Status Ok
Last Scan2024-05-23T06:56:42+00:00
Next Scan 2024-06-22T06:56:42+00:00

Last Scan

Scanned2024-05-23T06:56:42+00:00
URL https://ggd.amsterdam.nl/robots.txt
Domain IPs 2a07:3500:1020:f524::186, 46.17.24.186
Response IP 46.17.24.186
Found Yes
Hash 0aae66a54982a5d0541eb48886f4156af0988dc4a2702a43a190d090eea24326
SimHash 27124a246a92

Groups

simplepie

Rule Path
Disallow /

curl

Rule Path
Disallow /

python urllib

Rule Path
Disallow /

osce

Rule Path
Disallow /

wget

Rule Path
Disallow /

wotbox

Rule Path
Disallow /

yandexbot

Rule Path
Disallow /

ezooms

Rule Path
Disallow /

yandeximages

Rule Path
Disallow /

mj12bot

Rule Path
Disallow /

genieo

Rule Path
Disallow /

jobdiggerspider

Rule Path
Disallow /

exabot

Rule Path
Disallow /

baiduspider

Rule Path
Disallow /

abonti

Rule Path
Disallow /

linkchecker

Rule Path
Disallow /

jetslide

Rule Path
Disallow /

seokicks

Rule Path
Disallow /

sistrix

Rule Path
Disallow /

wbsearchbot

Rule Path
Disallow /

vagabondo

Rule Path
Disallow /

eknip

Rule Path
Disallow /

mail.ru_bot

Rule Path
Disallow /

kingspider

Rule Path
Disallow /

openindexspider

Rule Path
Disallow /

*

Rule Path
Disallow /aspx/
Disallow /*?*pdf=true*
Disallow /*?*rtf=true*
Disallow /*?*CalDtm=*
Disallow /*?*zoeken_term=*
Disallow /*?*Zoe=*

Other Records

Field Value
crawl-delay 3

Other Records

Field Value
sitemap https://www.ggd.amsterdam.nl/sitemap.xml

Warnings

  • 2 invalid lines.