stemcell.com
robots.txt

Robots Exclusion Standard data for stemcell.com

Resource Scan

Scan Details

Site Domain stemcell.com
Base Domain stemcell.com
Scan Status Ok
Last Scan2024-09-19T18:45:15+00:00
Next Scan 2024-10-19T18:45:15+00:00

Last Scan

Scanned2024-09-19T18:45:15+00:00
URL https://stemcell.com/robots.txt
Redirect https://www.stemcell.com/robots.txt
Redirect Domain www.stemcell.com
Redirect Base stemcell.com
Domain IPs 199.232.37.124
Redirect IPs 151.101.1.124, 151.101.129.124, 151.101.193.124, 151.101.65.124
Response IP 199.232.45.124
Found Yes
Hash c35b16cea975a1f62f44b053f7c48e4df0d8e25186368b9cebea1a601bb0b678
SimHash ee31dc78e27b

Groups

*

Rule Path
Disallow /index.php/
Disallow /*?
Disallow /checkout/
Disallow /*.php$
Disallow /customer/
Disallow /review/
Disallow /*SID%3D

baiduspider

No rules defined. All paths allowed.

Other Records

Field Value
crawl-delay 10

ahrefsbot

No rules defined. All paths allowed.

Other Records

Field Value
crawl-delay 5

rogerbot

Rule Path
Disallow /*modal%3Drequest-support

Other Records

Field Value
crawl-delay 10

seekportbot

No rules defined. All paths allowed.

Other Records

Field Value
crawl-delay 2

*

Rule Path
Disallow /ajax/

megaindex.ru

Rule Path
Disallow /

megaindex.com

Rule Path
Disallow /

optimizer

Rule Path
Disallow /

Comments

  • https://megaindex.com/crawler