linux.do
robots.txt
Robots Exclusion Standard data for linux.do
Resource Scan
Scan Details
Site Domain | linux.do |
Base Domain | linux.do |
Scan Status | Ok |
Last Scan | 2024-05-12T00:49:22+00:00 |
Next Scan | 2024-06-11T00:49:22+00:00 |
Last Scan
Scanned | 2024-05-12T00:49:22+00:00 |
URL | https://linux.do/robots.txt |
Domain IPs | 104.26.12.174, 104.26.13.174, 172.67.74.154, 2606:4700:20::681a:cae, 2606:4700:20::681a:dae, 2606:4700:20::ac43:4a9a |
Response IP | 104.26.13.174 |
Found | Yes |
Hash | 0fd8ca4900b55d36f52ae401a92a6e5f7681ac2512492992743da301782a6d47 |
SimHash | 299d1dc577d0 |
Groups
*
Rule | Path |
---|---|
Disallow | /invites/ |
Disallow | /admin/ |
Disallow | /auth/ |
Disallow | /assets/browser-update*.js |
Disallow | /email/ |
Disallow | /session |
Disallow | /user-api-key |
Disallow | /*?api_key* |
Disallow | /*?*api_key* |
Disallow | /badges |
Disallow | /u/ |
Disallow | /my |
Disallow | /search |
Disallow | /tag/*/l |
Disallow | /g |
Disallow | /t/*/*.rss |
Disallow | /c/*.rss |
Other Records
Field | Value |
---|---|
sitemap | https://linux.do/sitemap.xml |
Comments