gallagherseals.com
robots.txt

Robots Exclusion Standard data for gallagherseals.com

Resource Scan

Scan Details

Site Domain gallagherseals.com
Base Domain gallagherseals.com
Scan Status Ok
Last Scan2024-10-24T23:04:15+00:00
Next Scan 2024-11-23T23:04:15+00:00

Last Scan

Scanned2024-10-24T23:04:15+00:00
URL https://gallagherseals.com/robots.txt
Redirect https://www.gallagherseals.com/robots.txt
Redirect Domain www.gallagherseals.com
Redirect Base gallagherseals.com
Domain IPs 151.101.2.132
Redirect IPs 151.101.2.132
Response IP 151.101.2.132
Found Yes
Hash c3bc6147440753ffaa95c3cae92afa80aa8f2da6a30974d84f8da8bd23b1829b
SimHash 432dfb5147f3

Groups

*

Rule Path
Disallow /app/
Disallow /errors/
Disallow /lib/
Disallow /scripts/
Disallow /var/
Disallow /index.php/
Disallow /catalogsearch/
Disallow /checkout/
Disallow /customer/
Disallow /newsletter/
Disallow /review/
Disallow /sendfriend/
Disallow /*SID%3D
Disallow /LICENSE.txt
Disallow /LICENSE_AFL.txt
Disallow /*.php$
Disallow /*?*
Allow /*?p=*
Allow /blog*?page=*

Other Records

Field Value
crawl-delay 10

seekportbot

Rule Path
Disallow /

rogerbot
dotbot

Rule Path
Allow /*?*

Other Records

Field Value
sitemap https://www.gallagherseals.com/pub/sitemap.xml

Comments

  • Directories
  • Paths (clean URLs)
  • Files
  • Paths (no clean URLs)

Warnings

  • 40 invalid lines.