sumika.me
robots.txt
Robots Exclusion Standard data for sumika.me
Resource Scan
Scan Details
Site Domain | sumika.me |
Base Domain | sumika.me |
Scan Status | Ok |
Last Scan | 2024-11-01T18:25:20+00:00 |
Next Scan | 2024-11-08T18:25:20+00:00 |
Last Scan
Scanned | 2024-11-01T18:25:20+00:00 |
URL | https://sumika.me/robots.txt |
Domain IPs | 13.33.30.124, 13.33.30.57, 13.33.30.65, 13.33.30.72 |
Response IP | 13.33.30.57 |
Found | Yes |
Hash | 0b9a9cbd1e3b635e2a4c24b3efc4a313509bdcb0bbd9ff40c89920fa06d149b6 |
SimHash | 684591714ff4 |
Groups
*
Rule | Path |
---|---|
Disallow | /u/ |
Disallow | /p/ |
Disallow | /20739468/ |
Disallow | /pro_profiles/*/fb_link |
Disallow | /pro_profiles/*/hp_link |
Disallow | /search/pro/rm/*/pa/*/p/* |
Disallow | /search/pro/rm/*/pa/*/pp/*/p/* |
Disallow | /pros/area/*/p/* |
Disallow | /pros/role/*/p/* |
Disallow | /jobs/search/?page=* |
Disallow | /jobs/search?page=* |
Disallow | /seek_advices/search/* |
Disallow | /*.pdf$ |
Allow | /u/photo/*/clip_count |
Other Records
Field | Value |
---|---|
sitemap | https://sumika.me/sitemap.xml.gz |