globalsources.com
robots.txt

Robots Exclusion Standard data for globalsources.com

Resource Scan

Scan Details

Site Domain globalsources.com
Base Domain globalsources.com
Scan Status Ok
Last Scan2024-06-07T18:28:57+00:00
Next Scan 2024-07-07T18:28:57+00:00

Last Scan

Scanned2024-06-07T18:28:57+00:00
URL https://globalsources.com/robots.txt
Redirect https://www.globalsources.com/robots.txt
Redirect Domain www.globalsources.com
Redirect Base globalsources.com
Domain IPs 107.154.197.39, 107.154.200.39
Redirect IPs 107.154.197.39
Response IP 107.154.197.39
Found Yes
Hash 72a039256b80f0e93ae577a880ae5e741a02ac16fadaddc19dc0533855a7029e
SimHash 61c212e3d7b8

Groups

*

Rule Path
Disallow *%7B*
Disallow *.do?*
Disallow *undefine*
Disallow *productDetail*
Disallow /sensors/
Disallow /si/
Disallow /gsol/
Disallow /_nuxt/
Disallow /fonts/
Disallow /preview/

adsbot-google
adsbot-google-mobile

Rule Path
Disallow /sensors/
Disallow /si/
Disallow /gsol/
Disallow /product/
Disallow /_nuxt/
Disallow /products/
Disallow /wholesale/
Disallow /factory/
Disallow /TMX/
Disallow /fonts/
Disallow /preview/
Allow /product/landingPage/

yahooseeker

Rule Path
Disallow /

mj12bot

Rule Path
Disallow /

bytespider

Rule Path
Disallow /

voyager

No rules defined. All paths allowed.

Other Records

Field Value
crawl-delay 10

bingbot

No rules defined. All paths allowed.

Other Records

Field Value
crawl-delay 20

applebot

No rules defined. All paths allowed.

Other Records

Field Value
crawl-delay 20

Warnings

  • 2 invalid lines.