alexmaclean.com
robots.txt

Robots Exclusion Standard data for alexmaclean.com

Resource Scan

Scan Details

Site Domain alexmaclean.com
Base Domain alexmaclean.com
Scan Status Ok
Last Scan2026-01-16T11:48:59+00:00
Next Scan 2026-01-30T11:48:59+00:00

Last Scan

Scanned2026-01-16T11:48:59+00:00
URL https://alexmaclean.com/robots.txt
Redirect https://cakhiazxl.cc/robots.txt
Redirect Domain cakhiazxl.cc
Redirect Base cakhiazxl.cc
Domain IPs 104.21.44.223, 172.67.204.35, 2606:4700:3035::ac43:cc23, 2606:4700:3037::6815:2cdf
Redirect IPs 104.18.2.145, 104.18.3.145, 2606:4700::6812:291, 2606:4700::6812:391
Response IP 104.18.2.145
Found Yes
Hash e62ff2811747a3684667ed4d214498f9ffd45d7c2cf67539eb07abcde29bb4a0
SimHash 6ba9dc0043a2

Groups

*

Rule Path
Disallow /wp-admin/
Allow /wp-admin/admin-ajax.php
Disallow /*/trackback
Disallow /img/
Disallow /tag/
Disallow /feed
Disallow /*/feed
Disallow /?s=*
Disallow /?link=*
Disallow /attachment/
Disallow /author/
Disallow /page/*
Disallow /truc-tiep/page/*
Disallow /*?utm_source
Disallow /*%26utm_source

facebookexternalhit

Rule Path
Allow /

Other Records

Field Value
sitemap https://cakhiazxl.cc/sitemap.xml

Comments

  • Allow Facebook scraper