markandhazyl.com
robots.txt

Robots Exclusion Standard data for markandhazyl.com

Resource Scan

Scan Details

Site Domain markandhazyl.com
Base Domain markandhazyl.com
Scan Status Ok
Last Scan2025-08-22T21:31:39+00:00
Next Scan 2025-08-29T21:31:39+00:00

Last Scan

Scanned2025-08-22T21:31:39+00:00
URL https://markandhazyl.com/robots.txt
Redirect http://www.marknhazyl.com/robots.txt
Redirect Domain www.marknhazyl.com
Redirect Base marknhazyl.com
Domain IPs 104.21.2.194, 172.67.129.150, 2606:4700:3031::6815:2c2, 2606:4700:3034::ac43:8196
Redirect IPs 104.21.44.133, 172.67.200.145, 2606:4700:3032::6815:2c85, 2606:4700:3034::ac43:c891
Response IP 104.21.44.133
Found Yes
Hash 66d9d4d265650d4da348901f0ed03bebef5a0874852cd0cb1d0b7853f3459acd
SimHash 38915d0123c4

Groups

amazonbot

Rule Path
Disallow /

applebot-extended

Rule Path
Disallow /

bytespider

Rule Path
Disallow /

ccbot

Rule Path
Disallow /

claudebot

Rule Path
Disallow /

google-extended

Rule Path
Disallow /

gptbot

Rule Path
Disallow /

meta-externalagent

Rule Path
Disallow /

*

Rule Path
Disallow

Comments

  • NOTICE: The collection of content and other data on this
  • site through automated means, including any device, tool,
  • or process designed to data mine or scrape content, is
  • prohibited except (1) for the purpose of search engine indexing or
  • artificial intelligence retrieval augmented generation or (2) with express
  • written permission from this site’s operator.
  • To request permission to license our intellectual
  • property and/or other materials, please contact this
  • site’s operator directly.
  • BEGIN Cloudflare Managed content
  • END Cloudflare Managed Content
  • ****************************************************************************
  • robots.txt
  • : Robots, spiders, and search engines use this file to detmine which
  • content they should *not* crawl while indexing your website.
  • : This system is called "The Robots Exclusion Standard."
  • : It is strongly encouraged to use a robots.txt validator to check
  • for valid syntax before any robots read it!
  • Examples:
  • Instruct all robots to stay out of the admin area.
  • : User-agent: *
  • : Disallow: /admin/
  • Restrict Google and MSN from indexing your images.
  • : User-agent: Googlebot
  • : Disallow: /images/
  • : User-agent: MSNBot
  • : Disallow: /images/
  • ****************************************************************************