bruinsinsider.com
robots.txt
Robots Exclusion Standard data for bruinsinsider.com
Resource Scan
Scan Details
Site Domain | bruinsinsider.com |
Base Domain | bruinsinsider.com |
Scan Status | Ok |
Last Scan | 2024-05-28T04:00:18+00:00 |
Next Scan | 2024-06-04T04:00:18+00:00 |
Last Scan
Scanned | 2024-05-28T04:00:18+00:00 |
URL | https://bruinsinsider.com/robots.txt |
Redirect | https://www.bruinsinsider.com/robots.txt |
Redirect Domain | www.bruinsinsider.com |
Redirect Base | bruinsinsider.com |
Domain IPs | 68.168.112.242 |
Redirect IPs | 68.168.112.242 |
Response IP | 68.168.112.242 |
Found | Yes |
Hash | 3a60e7f590660093135a42e975b46d7a43a162531489fff1623d2b402af354a5 |
SimHash | c6051c5106b2 |
Groups
*
Rule | Path |
---|---|
Disallow | /_ftp_2021 |
Disallow | /_pagelog.php |
Disallow | /_publog.php |
Disallow | /_update.php |
Disallow | /_update_stats.php |
Disallow | /adwordstracking.php |
Disallow | /clic.php |
Disallow | /log.txt |
Disallow | /log_pub.txt |
Other Records
Field | Value |
---|---|
sitemap | https://www.bruinsinsider.com/sitemap.xml |
sitemap | https://www.bruinsinsider.com/sitemap_news.xml |