sic.pt
robots.txt
Robots Exclusion Standard data for sic.pt
Resource Scan
Scan Details
Site Domain | sic.pt |
Base Domain | sic.pt |
Scan Status | Ok |
Last Scan | 2024-11-12T01:48:18+00:00 |
Next Scan | 2024-11-19T01:48:18+00:00 |
Last Scan
Scanned | 2024-11-12T01:48:18+00:00 |
URL | https://sic.pt/robots.txt |
Domain IPs | 13.226.2.23, 13.226.2.27, 13.226.2.38, 13.226.2.59 |
Response IP | 13.226.2.27 |
Found | Yes |
Hash | 30e1647f8dd62920b3f7a6e459638d4a21d18fdbd21dcb8eedc7820ef44abcad |
SimHash | 6a1ccb552c93 |
Groups
*
Rule | Path |
---|---|
Disallow | /api |
Disallow | /sites |
Disallow | /sandbox |
Disallow | /sandbox2013 |
Disallow | /pesquisa |
Disallow | /hotfolder-video-exports |
Allow | / |
Other Records
Field | Value |
---|---|
sitemap | https://sic.pt/sitemap/news.xml |
sitemap | https://sic.pt/sitemap/index.xml |
sitemap | https://sic.pt/sitemap/videos.xml |