thealbertan.com
robots.txt

Robots Exclusion Standard data for thealbertan.com

Resource Scan

Scan Details

Site Domain thealbertan.com
Base Domain thealbertan.com
Scan Status Ok
Last Scan2024-10-30T09:56:29+00:00
Next Scan 2024-11-06T09:56:29+00:00

Last Scan

Scanned2024-10-30T09:56:29+00:00
URL https://thealbertan.com/robots.txt
Redirect https://www.thealbertan.com/robots.txt
Redirect Domain www.thealbertan.com
Redirect Base thealbertan.com
Domain IPs 104.18.30.211, 104.18.31.211, 2606:4700::6812:1ed3, 2606:4700::6812:1fd3
Redirect IPs 104.18.30.211, 104.18.31.211, 2606:4700::6812:1ed3, 2606:4700::6812:1fd3
Response IP 104.18.30.211
Found Yes
Hash b240b47c0c5c74c64961628aa2475377ca50bc326c0843c39586f8ad48e50c0f
SimHash 4904cba0e510

Groups

*

Rule Path
Allow /

googlebot-news

Rule Path
Allow /rss/showcase

googlebot

Rule Path
Allow /rss/showcase

semrushbot

Rule Path
Disallow /

gptbot

Rule Path
Disallow /

petalbot

Rule Path
Disallow /