newsca.ca
robots.txt
Robots Exclusion Standard data for newsca.ca
Resource Scan
Scan Details
Site Domain | newsca.ca |
Base Domain | newsca.ca |
Scan Status | Ok |
Last Scan | 2025-04-05T07:42:22+00:00 |
Next Scan | 2025-05-05T07:42:22+00:00 |
Last Scan
Scanned | 2025-04-05T07:42:22+00:00 |
URL | https://newsca.ca/robots.txt |
Redirect | https://wingservices.ca/robots.txt |
Redirect Domain | wingservices.ca |
Redirect Base | wingservices.ca |
Domain IPs | 104.21.30.127, 172.67.172.236, 2606:4700:3033::6815:1e7f, 2606:4700:3035::ac43:acec |
Redirect IPs | 104.21.90.152, 172.67.202.61, 2606:4700:3030::ac43:ca3d, 2606:4700:3034::6815:5a98 |
Response IP | 104.21.90.152 |
Found | Yes |
Hash | 488d3d4ade1010adcd00c9ab9779c010b01b497f71f489b5ea4a5cb48f8ffbfb |
SimHash | 6b841851c193 |
Groups
*
Rule | Path |
---|---|
Disallow | /wp-admin/ |
Disallow | /readme.html |
Disallow | /license.txt |
Disallow | /?s=* |
Allow | /*.js$ |
Allow | /*.css$ |
Allow | /wp-admin/images/* |
Allow | /wp-admin/admin-ajax.php |
Other Records
Field | Value |
---|---|
sitemap | https://wingservices.ca/sitemap_index.xml |