accessart.org.uk
robots.txt

Robots Exclusion Standard data for accessart.org.uk

Archived Snapshots

Resource Scan

Scan Details

Site Domain	accessart.org.uk
Base Domain	accessart.org.uk
Scan Status	Ok
Last Scan	2024-09-27T09:02:56+00:00
Next Scan	2024-10-27T09:02:56+00:00

Last Scan

Scanned	2024-09-27T09:02:56+00:00
URL	https://accessart.org.uk/robots.txt
Redirect	https://www.accessart.org.uk/robots.txt
Redirect Domain	www.accessart.org.uk
Redirect Base	accessart.org.uk
Domain IPs	192.124.249.59
Redirect IPs	192.124.249.59
Response IP	192.124.249.59
Found	Yes
Hash	ed6a39c3a56e83a47df523192dec3763d012d5b1d94ffa4f114816b26c666d44
SimHash	cce6c94174a9

Groups

*

Rule	Path
Disallow	/wp-admin/
Disallow	/wp-includes/
Disallow	/wp-content/plugins/
Disallow	/wp-content/themes/
Disallow	/cgi-bin/
Disallow	/trackback/
Disallow	/xmlrpc.php
Disallow	/wp-login.php
Disallow	/wp-signup.php
Disallow	/?s=*
Disallow	/search/
Disallow	/search/*
Disallow	/wp-json/
Disallow	/*/trackback/
Disallow	/*/comments/
Disallow	/?add-to-cart=
Disallow	/?orderby=
Disallow	/?filter_
Disallow	/cdn-cgi/bm/cv/
Disallow	/cdn-cgi/challenge-platform/
Allow	/wp-content/uploads/
Allow	/wp-content/cache/

Rule

Path

Disallow

/wp-admin/

Disallow

/wp-includes/

Disallow

/wp-content/plugins/

Disallow

/wp-content/themes/

Disallow

/cgi-bin/

Disallow

/trackback/

Disallow

/xmlrpc.php

Disallow

/wp-login.php

Disallow

/wp-signup.php

Disallow

/?s=*

Disallow

/search/

Disallow

/search/*

Disallow

/wp-json/

Disallow

/*/trackback/

Disallow

/*/comments/

Disallow

/*?add-to-cart=*

Disallow

/*?orderby=*

Disallow

/*?filter_*

Disallow

/cdn-cgi/bm/cv/

Disallow

/cdn-cgi/challenge-platform/

Allow

/wp-content/uploads/

Allow

/wp-content/cache/

ahrefsbot
semrushbot
mj12bot
dotbot
nuclei
wikido
riddler
petalbot
zoominfobot
go-http-client
node/simplecrawler
cazoodlebot
dotbot/1.0
gigabot
barkrowler
blexbot
magpie-crawler

Rule	Path
Disallow	/

Rule

Path

Disallow

/

googlebot

Rule	Path
Disallow	/*/feed/

Rule

Path

Disallow

/*/feed/

bingbot

Rule	Path
Disallow	/*/feed/

Rule

Path

Disallow

/*/feed/

slurp

Rule	Path
Disallow	/*/feed/

Rule

Path

Disallow

/*/feed/

*

Rule	Path
Allow	/*/feed/

Rule

Path

Allow

/*/feed/

Back to top

Other Records

Field	Value
sitemap	https://www.accessart.org.uk/sitemap.xml

Field

Value

sitemap

https://www.accessart.org.uk/sitemap.xml

Back to top

Comments

Global rules
-----------------
Prevent crawling CF challenge URLs
Allow access to necessary assets
Sitemap
-----------------
Ban bots that don't benefit us.
--------------------------------
Block feeds for search engines to reduce server load
-------------------------------
Allow feeds for other user agents
-----------------------------------

Back to top

accessart.org.ukrobots.txt

Resource Scan

Scan Details

Last Scan

Groups

*

ahrefsbotsemrushbotmj12botdotbotnucleiwikidoriddlerpetalbotzoominfobotgo-http-clientnode/simplecrawlercazoodlebotdotbot/1.0gigabotbarkrowlerblexbotmagpie-crawler

googlebot

bingbot

slurp

*

Other Records

Comments

accessart.org.uk
robots.txt

ahrefsbot
semrushbot
mj12bot
dotbot
nuclei
wikido
riddler
petalbot
zoominfobot
go-http-client
node/simplecrawler
cazoodlebot
dotbot/1.0
gigabot
barkrowler
blexbot
magpie-crawler