huaweicentral.com
robots.txt

Robots Exclusion Standard data for huaweicentral.com

Resource Scan

Scan Details

Site Domain huaweicentral.com
Base Domain huaweicentral.com
Scan Status Ok
Last Scan2025-10-27T02:05:36+00:00
Next Scan 2025-11-03T02:05:36+00:00

Last Scan

Scanned2025-10-27T02:05:36+00:00
URL https://huaweicentral.com/robots.txt
Domain IPs 104.26.4.223, 104.26.5.223, 172.67.68.88, 2606:4700:20::681a:4df, 2606:4700:20::681a:5df, 2606:4700:20::ac43:4458
Response IP 172.67.68.88
Found Yes
Hash cd7bc3787306f6a375b94adeb95058243df477ff4ea14a5f00c4b5a53b8ce6b3
SimHash 634ed850a881

Groups

*

Rule Path
Disallow /wp-admin/
Disallow /wp-includes/
Disallow /wp-login.php
Disallow /cdn-cgi/

mediapartners-google

Rule Path
Disallow

yandex

Rule Path
Disallow /

ahrefsbot

No rules defined. All paths allowed.

Other Records

Field Value
crawl-delay 10

bingbot

No rules defined. All paths allowed.

Other Records

Field Value
crawl-delay 10

slurp

No rules defined. All paths allowed.

Other Records

Field Value
crawl-delay 10

seznambot

No rules defined. All paths allowed.

Other Records

Field Value
crawl-delay 10

dataforseobot

No rules defined. All paths allowed.

Other Records

Field Value
crawl-delay 20

dotbot

Rule Path
Disallow /

grapeshot

Rule Path
Disallow

gptbot

Rule Path
Disallow /

mj12bot

Rule Path
Disallow /

amazonbot

Rule Path
Disallow /

facebookbot

Rule Path
Disallow /

chatgpt-user

Rule Path
Disallow /

claudebot

Rule Path
Disallow /

claude-web

Rule Path
Disallow /

meta-externalagent

Rule Path
Disallow /

Other Records

Field Value
sitemap https://www.huaweicentral.com/sitemap_index.xml

Comments

  • Sitemap location