busuu.com
robots.txt
Robots Exclusion Standard data for busuu.com
Resource Scan
Scan Details
Site Domain | busuu.com |
Base Domain | busuu.com |
Scan Status | Ok |
Last Scan | 2024-06-29T18:40:58+00:00 |
Next Scan | 2024-07-06T18:40:58+00:00 |
Last Scan
Scanned | 2024-06-29T18:40:58+00:00 |
URL | https://busuu.com/robots.txt |
Redirect | https://www.busuu.com/robots.txt |
Redirect Domain | www.busuu.com |
Redirect Base | busuu.com |
Domain IPs | 108.128.32.174, 52.18.120.67, 52.213.247.219 |
Redirect IPs | 108.128.32.174, 52.18.120.67, 52.213.247.219 |
Response IP | 108.128.32.174 |
Found | Yes |
Hash | 1f4ff9f4f5e4324bdf4928ff37b4fcee3b78909729deab5a242ba2a41ec3c216 |
SimHash | 512551c04a97 |
Groups
*
Rule | Path |
---|---|
Disallow | /backup/ |
Disallow | /bin/ |
Disallow | /cache/ |
Disallow | /grav/ |
Disallow | /logs/ |
Disallow | /system/ |
Disallow | /vendor/ |
Disallow | /user/ |
Allow | /user/pages/ |
Allow | /user/themes/ |
Disallow | /n/*/paypal/redirect/ |
Disallow | /node_modules/ |
Disallow | /node/ |
Disallow | /dashboard/ |
Disallow | /products/ |
Disallow | /v/ |
Other Records
Field | Value |
---|---|
sitemap | https://www.busuu.com/sitemap.xml |
Comments