agileacademy.nl
robots.txt

Robots Exclusion Standard data for agileacademy.nl

Archived Snapshots

Resource Scan

Scan Details

Site Domain	agileacademy.nl
Base Domain	agileacademy.nl
Scan Status	Ok
Last Scan	2024-09-27T11:47:15+00:00
Next Scan	2024-10-27T11:47:15+00:00

Last Scan

Scanned	2024-09-27T11:47:15+00:00
URL	https://agileacademy.nl/robots.txt
Domain IPs	62.221.192.187
Response IP	62.221.192.187
Found	Yes
Hash	003ae4034a7fe81fe04a1b956584d63a7dc7b784c284bb9ec83163a2bace3572
SimHash	5800dd194ffc

Groups

*

Rule	Path
Disallow	/

Rule

Path

Disallow

/

Other Records

Field	Value
crawl-delay	29

Field

Value

crawl-delay

29

googlebot
googlebot-image
mediapartners-google
msnbot
msnbot-media
slurp
yahoo-blogs
yahoo-mmcrawler
rogerbot

Rule	Path
Disallow	/includes/
Disallow	/mail/
Disallow	/misc/
Disallow	/modules/
Disallow	/profiles/
Disallow	/scripts/
Disallow	/sites/
Disallow	/themes/
Disallow	/pcs/
Disallow	/img/
Disallow	/fix/
Disallow	/tnc/
Disallow	/signature/
Disallow	/signaturein/
Disallow	/signature_info/
Disallow	/CHANGELOG.txt
Disallow	/cron.php
Disallow	/INSTALL.mysql.txt
Disallow	/INSTALL.pgsql.txt
Disallow	/install.php
Disallow	/INSTALL.txt
Disallow	/LICENSE.txt
Disallow	/MAINTAINERS.txt
Disallow	/update.php
Disallow	/UPGRADE.txt
Disallow	/xmlrpc.php
Disallow	/admin/
Disallow	/wp-admin/
Allow	/wp-admin/admin-ajax.php
Disallow	/comment/reply/
Disallow	/contact/
Disallow	/logout/
Disallow	/node/add/
Disallow	/search/
Disallow	/opensearch/
Disallow	/user/register/
Disallow	/user/password/
Disallow	/user/login/
Disallow	/?q=admin%2F
Disallow	/?q=comment%2Freply%2F
Disallow	/?q=contact%2F
Disallow	/?q=logout%2F
Disallow	/?q=node%2Fadd%2F
Disallow	/?q=search%2F
Disallow	/?q=user%2Fpassword%2F
Disallow	/?q=user%2Fregister%2F
Disallow	/?q=user%2Flogin%2F

Rule

Path

Disallow

/includes/

Disallow

/mail/

Disallow

/misc/

Disallow

/modules/

Disallow

/profiles/

Disallow

/scripts/

Disallow

/sites/

Disallow

/themes/

Disallow

/pcs/

Disallow

/img/

Disallow

/fix/

Disallow

/tnc/

Disallow

/signature/

Disallow

/signaturein/

Disallow

/signature_info/

Disallow

/CHANGELOG.txt

Disallow

/cron.php

Disallow

/INSTALL.mysql.txt

Disallow

/INSTALL.pgsql.txt

Disallow

/install.php

Disallow

/INSTALL.txt

Disallow

/LICENSE.txt

Disallow

/MAINTAINERS.txt

Disallow

/update.php

Disallow

/UPGRADE.txt

Disallow

/xmlrpc.php

Disallow

/admin/

Disallow

/wp-admin/

Allow

/wp-admin/admin-ajax.php

Disallow

/comment/reply/

Disallow

/contact/

Disallow

/logout/

Disallow

/node/add/

Disallow

/search/

Disallow

/opensearch/

Disallow

/user/register/

Disallow

/user/password/

Disallow

/user/login/

Disallow

/?q=admin%2F

Disallow

/?q=comment%2Freply%2F

Disallow

/?q=contact%2F

Disallow

/?q=logout%2F

Disallow

/?q=node%2Fadd%2F

Disallow

/?q=search%2F

Disallow

/?q=user%2Fpassword%2F

Disallow

/?q=user%2Fregister%2F

Disallow

/?q=user%2Flogin%2F

Other Records

Field	Value
crawl-delay	29

Field

Value

crawl-delay

29

Back to top

Comments

robots.txt
Dit bestand voorkomt dat crawlers en indexers bepaalde delen van jouw website kunnen benaderen.
Hiermee geef je de robots aan wat er verboden is.
Dit zal veel bandbreedte en server resources schelen.
Dit bestand zal enkel werken als hij in de root van jouw site staat:
Used: https://example.com/robots.txt
Ignored: https://example.com/site/robots.txt
For more information about the robots.txt standard, see:
https://www.robotstxt.org/wc/robots.html
For syntax checking, see:
http://www.sxw.org.uk/computing/robots/check.html
disallow all
but allow only important bots
Directories
Files
Paths (clean URLs)
Paths (no clean URLs)

Back to top

agileacademy.nlrobots.txt

Resource Scan

Scan Details

Last Scan

Groups

*

Other Records

googlebotgooglebot-imagemediapartners-googlemsnbotmsnbot-mediaslurpyahoo-blogsyahoo-mmcrawlerrogerbot

Other Records

Comments

agileacademy.nl
robots.txt

googlebot
googlebot-image
mediapartners-google
msnbot
msnbot-media
slurp
yahoo-blogs
yahoo-mmcrawler
rogerbot