patternbank.com
robots.txt

Robots Exclusion Standard data for patternbank.com

Resource Scan

Scan Details

Site Domain patternbank.com
Base Domain patternbank.com
Scan Status Ok
Last Scan2025-10-02T19:07:47+00:00
Next Scan 2025-11-01T19:07:47+00:00

Last Scan

Scanned2025-10-02T19:07:47+00:00
URL https://patternbank.com/robots.txt
Domain IPs 104.26.6.81, 104.26.7.81, 172.67.73.147, 2606:4700:20::681a:651, 2606:4700:20::681a:751, 2606:4700:20::ac43:4993
Response IP 172.67.73.147
Found Yes
Hash 79a99db0ef6b77214e7aa91cedf3ca68a88b40af690be49d916ef1df3d98ec89
SimHash 2a1db9a1eaf8

Groups

blexbot

Rule Path
Disallow /

mj12bot

Rule Path
Disallow /

twengabot

Rule Path
Disallow /

ahrefsbot

Rule Path
Disallow /

semrushbot

Rule Path
Disallow /

wotbox

Rule Path
Disallow /

sosospider

Rule Path
Disallow /

baiduspider

Rule Path
Disallow /

zumbot

Rule Path
Disallow /

yandexbot

Rule Path
Disallow /

*

Rule Path
Allow /
Disallow /*?licence=
Disallow /*%26licence%3D
Disallow /*modals?dialog=true
Disallow /*?modal=
Disallow /*%26modal%3D
Disallow /*?tab=
Disallow /*tab%3D
Disallow /*?image_format=
Disallow /*%26image_format%3D
Disallow /*?sort=
Disallow /*%26sort%3D
Disallow /*?page=
Disallow /*%26page%3D
Disallow /colours/*

Other Records

Field Value
crawl-delay 25

Other Records

Field Value
sitemap https://patternbank.com/sitemap.xml.gz

Comments

  • See http://www.robotstxt.org/wc/norobots.html for documentation on how to use the robots.txt file
  • To ban all spiders from the entire site uncomment the next two lines:
  • User-agent: *
  • Disallow: /

Warnings

  • 2 invalid lines.