zszyhl.com
robots.txt

Robots Exclusion Standard data for zszyhl.com

Resource Scan

Scan Details

Site Domain zszyhl.com
Base Domain zszyhl.com
Scan Status Failed
Failure StageFetching resource.
Failure ReasonServer returned a client error.
Last Scan2024-05-26T17:45:15+00:00
Next Scan 2024-06-02T17:45:15+00:00

Last Successful Scan

Scanned2024-05-18T17:38:59+00:00
URL https://zszyhl.com/robots.txt
Domain IPs 2a02:4780:84:907a:dbe2:6ecc:5be2:f340, 84.32.84.30
Response IP 77.37.66.240
Found Yes
Hash 6e2f7e9cc117914c08f0abe00e727b56d0939da2c961ab63d68d97d72d527ad2
SimHash 2de895cb4192

Groups

*

Rule Path
Disallow /wp-admin/
Allow /wp-admin/admin-ajax.php
Disallow /*horseracing/racecards/2019*
Disallow /*horseracing/racecards/2020*
Disallow /*horseracing/racecards/2021*
Disallow /*horseracing/racecards/2022*
Disallow /*horseracing/results/2019*
Disallow /*horseracing/results/2020*
Disallow /*horseracing/results/2021*
Disallow /*horseracing/results/2022*
Disallow /search/
Disallow /simwidgets/
Disallow /*?s=*
Disallow *%26s%3D*
Disallow /?p=*
Disallow /app/
Disallow /sso/login/
Disallow /wp-login.php
Disallow /amp-tealium/
Disallow /archives/

ccbot

Rule Path
Disallow /

anthropic-ai

Rule Path
Disallow /

cohere-ai

Rule Path
Disallow /

ia_archiver

Rule Path
Disallow /

omgili

Rule Path
Disallow /

omgilibot

Rule Path
Disallow /

mj12bot

Rule Path
Disallow /

piplbot

Rule Path
Disallow /

google-extended

Rule Path
Disallow /

meltwater

Rule Path
Disallow /

claude-web

Rule Path
Disallow /

claudebot

Rule Path
Disallow /

perplexity-ai

Rule Path
Disallow /

seekr

Rule Path
Disallow /

anthropic-aibytespider

Rule Path
Disallow /

magpie-crawler

Rule Path
Disallow /

newsnow

Rule Path
Disallow /

news-please

Rule Path
Disallow /

facebookbot

Rule Path
Disallow /

Other Records

Field Value
sitemap /sitemap.xml
sitemap /news-sitemap.xml
sitemap /nav-sitemap.xml
sitemap /author-sitemap.xml

Comments

  • Sitemap archive
  • News Sitemap
  • Nav Sitemap
  • Author Sitemap