topmarks.co.uk
robots.txt

Robots Exclusion Standard data for topmarks.co.uk

Resource Scan

Scan Details

Site Domain topmarks.co.uk
Base Domain topmarks.co.uk
Scan Status Ok
Last Scan2024-11-16T11:12:16+00:00
Next Scan 2024-11-23T11:12:16+00:00

Last Scan

Scanned2024-11-16T11:12:16+00:00
URL https://topmarks.co.uk/robots.txt
Redirect https://www.topmarks.co.uk/robots.txt
Redirect Domain www.topmarks.co.uk
Redirect Base topmarks.co.uk
Domain IPs 104.26.0.184, 104.26.1.184, 172.67.73.207, 2606:4700:20::681a:1b8, 2606:4700:20::681a:b8, 2606:4700:20::ac43:49cf
Redirect IPs 104.26.0.184, 104.26.1.184, 172.67.73.207, 2606:4700:20::681a:1b8, 2606:4700:20::681a:b8, 2606:4700:20::ac43:49cf
Response IP 104.26.0.184
Found Yes
Hash 37273c6baffc9d5dcdc65e9f0630ba0090f3f6f8147be84a8ec61d22b07b3130
SimHash 89507a545755

Groups

mediapartners-google

Rule Path
Disallow

twitterbot

Rule Path
Disallow

exabot

Rule Path
Disallow *.gif$
Disallow *.jpg$
Disallow *.png$

googlebot-image

Rule Path
Disallow /

yahoo-mmcrawler

Rule Path
Disallow /

psbot

Rule Path
Disallow /

*

Rule Path
Disallow /admin/
Disallow /ads/
Disallow /bin/
Disallow /errors/
Disallow /images/
Disallow /media/
Disallow /parents/images/
Disallow /TermsOfUse
Disallow /InteractiveLiteracy.aspx
Disallow /InteractiveScience.aspx
Disallow /InteractiveSubjects.aspx
Disallow /PlayPop.aspx
Disallow /r.aspx
Allow /media/social/