matthewherbert.com
robots.txt

Robots Exclusion Standard data for matthewherbert.com

Resource Scan

Scan Details

Site Domain matthewherbert.com
Base Domain matthewherbert.com
Scan Status Ok
Last Scan2025-12-01T23:00:35+00:00
Next Scan 2025-12-31T23:00:35+00:00

Last Scan

Scanned2025-12-01T23:00:35+00:00
URL https://matthewherbert.com/robots.txt
Domain IPs 104.21.26.169, 172.67.137.101, 2606:4700:3036::6815:1aa9, 2606:4700:3037::ac43:8965
Response IP 172.67.137.101
Found Yes
Hash 09cda9ae667d21feb9066a90138d65e8bb0f3ea6bc2910b6d3ddd7aacbd94e64
SimHash 3808d102ca32

Groups

*

Rule Path
Allow /

cutestat

Rule Path
Disallow /

mj12bot

Rule Path
Disallow /

petalbot

Rule Path
Disallow /

siteauditbot

Rule Path
Disallow /

semrushbot-ba

Rule Path
Disallow /

semrushbot-si

Rule Path
Disallow /

semrushbot-swa

Rule Path
Disallow /

semrushbot-ct

Rule Path
Disallow /

splitsignalbot

Rule Path
Disallow /

semrushbot-coub

Rule Path
Disallow /

rogerbot

Rule Path
Disallow /

dotbot

Rule Path
Disallow /

Other Records

Field Value
sitemap https://matthewherbert.com/sitemap.xml