mainline.co.uk
robots.txt

Robots Exclusion Standard data for mainline.co.uk

Resource Scan

Scan Details

Site Domain mainline.co.uk
Base Domain mainline.co.uk
Scan Status Ok
Last Scan2025-06-24T05:29:29+00:00
Next Scan 2025-07-24T05:29:29+00:00

Last Scan

Scanned2025-06-24T05:29:29+00:00
URL http://mainline.co.uk/robots.txt
Redirect http://www.mainline.co.uk/robots.txt
Redirect Domain www.mainline.co.uk
Redirect Base mainline.co.uk
Domain IPs 208.67.249.236
Redirect IPs 208.67.249.236
Response IP 208.67.249.236
Found Yes
Hash 6dab9c87a5c57a2ca79079dbcd0e2bbe15e98a0eabce3ac59944121e1d686ea0
SimHash 583dd250e5ab

Groups

googlebot

Rule Path
Allow /

bingbot

Rule Path
Allow /

duckduckbot

Rule Path
Allow /

applebot

Rule Path
Allow /

yahoo

Rule Path
Allow /

yandex

Rule Path
Allow /

baiduspider

Rule Path
Allow /

gptbot

Rule Path
Disallow /

chatgpt-user

Rule Path
Disallow /

oai-searchbot

Rule Path
Disallow /

meta-externalagent

Rule Path
Disallow /

facebookexternalhit

Rule Path
Disallow /

meta-externalfetcher

Rule Path
Disallow /

ahrefsbot

Rule Path
Disallow /

dataforseobot

Rule Path
Disallow /

mj12bot

Rule Path
Disallow /

dotbot

Rule Path
Disallow /

semrushbot

Rule Path
Disallow /

blexbot

Rule Path
Disallow /

*

Rule Path
Disallow /cgi-bin/
Disallow /wp-admin/
Disallow /wp-includes/
Disallow /tmp/
Disallow /private/
Disallow /config/

Comments

  • Allow well-known search engines and reputable crawlers
  • Block known unwanted bots and scrapers
  • General restrictions for all other bots