gale.com
robots.txt

Robots Exclusion Standard data for gale.com

Resource Scan

Scan Details

Site Domain gale.com
Base Domain gale.com
Scan Status Ok
Last Scan2024-10-30T00:42:30+00:00
Next Scan 2024-11-29T00:42:30+00:00

Last Scan

Scanned2024-10-30T00:42:30+00:00
URL https://gale.com/robots.txt
Redirect https://www.gale.com/robots.txt
Redirect Domain www.gale.com
Redirect Base gale.com
Domain IPs 104.18.22.22, 104.18.23.22, 2606:4700::6812:1616, 2606:4700::6812:1716
Redirect IPs 104.18.22.22, 104.18.23.22, 2606:4700::6812:1616, 2606:4700::6812:1716
Response IP 104.18.22.22
Found Yes
Hash 5b5dcfe8ccc36aea2cb7ce3d938464aaac941add162dab1a44d0b2444abddfc2
SimHash 60044a00e110

Groups

*

Rule Path
Disallow /es/
Disallow /pt/
Disallow /*N-5p*

Other Records

Field Value
sitemap https://www.gale.com/sitemap.xml
sitemap https://www.gale.com/intl/sitemap.xml
sitemap https://www.gale.com/jp/sitemap.xml
sitemap https://www.gale.com/cn/sitemap.xml
sitemap https://www.gale.com/thorndike/sitemap.xml