freepatentsonline.com
robots.txt

Robots Exclusion Standard data for freepatentsonline.com

Resource Scan

Scan Details

Site Domain freepatentsonline.com
Base Domain freepatentsonline.com
Scan Status Ok
Last Scan2024-09-21T02:10:55+00:00
Next Scan 2024-09-28T02:10:55+00:00

Last Scan

Scanned2024-09-21T02:10:55+00:00
URL https://freepatentsonline.com/robots.txt
Redirect https://www.freepatentsonline.com/robots.txt
Redirect Domain www.freepatentsonline.com
Redirect Base freepatentsonline.com
Domain IPs 144.202.252.20
Redirect IPs 144.202.252.20
Response IP 144.202.252.20
Found Yes
Hash 9b4d3e8f9726f1777904cb5d6d6d8ebc3927e0994894542154ca2d314659c1d0
SimHash ac10c3570015

Groups

googlebot
mediapartners-google

Rule Path
Allow /
Disallow /*.pdf$
Disallow /y2003/0155736.html
Disallow /y2004/0043059.html
Disallow /createplaque.html?*
Disallow /surechem/index.php?*
Disallow /chemical/index.php?*
Disallow /*-display.jpg
Disallow /export/email_popup.php?*
Allow /surechem/
Allow /chemical/
Disallow /surechem/*.*
Disallow /chemical/*.*
Disallow /*?
Disallow /CCL1988
Disallow /CCL1990
Disallow /CCL1995
Disallow /CCL2001
Disallow /CCL2002
Disallow /CCL2003
Disallow /CCL2004
Disallow /CCL2005
Disallow /CCLA
Disallow /CCLB
Disallow /CCLC
Disallow /CCLD
Disallow /CCLF
Disallow /CCLG
Disallow /CCLH
Disallow /XEF
Disallow /CCL4
Disallow /EP*A.html
Disallow /EP*A1.html
Disallow /EP*A2.html
Disallow /EP*B1.html
Disallow /EP*B2.html
Disallow /regkey
Disallow /netacgi
Disallow /WO*A*.html

*

Rule Path
Disallow /