novelhall.com
robots.txt
Robots Exclusion Standard data for novelhall.com
Resource Scan
Scan Details
Site Domain | novelhall.com |
Base Domain | novelhall.com |
Scan Status | Ok |
Last Scan | 2024-09-21T15:33:17+00:00 |
Next Scan | 2024-09-28T15:33:17+00:00 |
Last Scan
Scanned | 2024-09-21T15:33:17+00:00 |
URL | https://novelhall.com/robots.txt |
Redirect | https://www.novelhall.com/robots.txt |
Redirect Domain | www.novelhall.com |
Redirect Base | novelhall.com |
Domain IPs | 104.26.8.235, 104.26.9.235, 172.67.71.72, 2606:4700:20::681a:8eb, 2606:4700:20::681a:9eb, 2606:4700:20::ac43:4748 |
Redirect IPs | 104.26.8.235, 104.26.9.235, 172.67.71.72, 2606:4700:20::681a:8eb, 2606:4700:20::681a:9eb, 2606:4700:20::ac43:4748 |
Response IP | 104.26.9.235 |
Found | Yes |
Hash | 8d2c22f950e90e76f45831f48b6c319e9355e07332fbc8267be5f26c2cf87dcc |
SimHash | 81009901c7b0 |
Groups
*
Rule | Path |
---|---|
Disallow | /cache/ |
Disallow | /config/ |
Disallow | /diy/ |
Disallow | /search-keyword-*.html |
Disallow | /book-list-novel-*.html |
Disallow | *?s=* |
Disallow | *?c=* |
Other Records
Field | Value |
---|---|
sitemap | https://www.novelhall.com/sitemap.xml |
Comments