rusdate.ca
robots.txt
Robots Exclusion Standard data for rusdate.ca
Resource Scan
Scan Details
Site Domain | rusdate.ca |
Base Domain | rusdate.ca |
Scan Status | Ok |
Last Scan | 2024-09-24T23:25:04+00:00 |
Next Scan | 2024-10-01T23:25:04+00:00 |
Last Scan
Scanned | 2024-09-24T23:25:04+00:00 |
URL | https://rusdate.ca/robots.txt |
Domain IPs | 52.18.83.69, 99.81.198.34 |
Response IP | 99.81.198.34 |
Found | Yes |
Hash | e9a514dc29505e3f9d82f05f010dd06e8df046448a5bf15f9d7f26690b584707 |
SimHash | 484ed44181b2 |
Groups
*
Rule | Path |
---|---|
Disallow | /*?* |
Disallow | /?* |
Disallow | /polls/ |
Disallow | /articles/ |
Disallow | /support/ |
Disallow | *.php* |
Disallow | /?action= |
Disallow | /*/?genre |
Disallow | /?tid= |
Disallow | /r/ |
Allow | /css/ |
Allow | /js/ |
Allow | /wl/ |
Allow | /site-images/ |
Allow | /?action=landing* |
googlebot
Rule | Path |
---|---|
Disallow | /*?* |
Disallow | /?* |
Disallow | /polls/ |
Disallow | /articles/ |
Disallow | /support/ |
Disallow | *.php* |
Disallow | /?action= |
Disallow | /*/?genre |
Disallow | /?tid= |
Disallow | /r/ |
Allow | /css/ |
Allow | /js/ |
Allow | /wl/ |
Allow | /site-images/ |
Allow | /?action=landing* |
Other Records
Field | Value |
---|---|
sitemap | https://rusdate.ca/sitemap.xml.gz |
Warnings
- `clean-param` is not a known field.
- `host` is not a known field.