capbase.com
robots.txt
Robots Exclusion Standard data for capbase.com
Resource Scan
Scan Details
Site Domain | capbase.com |
Base Domain | capbase.com |
Scan Status | Ok |
Last Scan | 2024-10-16T15:28:15+00:00 |
Next Scan | 2024-11-15T15:28:15+00:00 |
Last Scan
Scanned | 2024-10-16T15:28:15+00:00 |
URL | https://capbase.com/robots.txt |
Domain IPs | 108.156.133.104, 108.156.133.23, 108.156.133.60, 108.156.133.69 |
Response IP | 108.156.133.69 |
Found | Yes |
Hash | 8860a147216b17f2fa36b591ebeda7df23b95449802153a685279df71651803d |
SimHash | 4c01d970e193 |
Groups
*
Rule | Path |
---|---|
Allow | / |
Disallow | /interview$ |
Disallow | /interview/$ |
Disallow | /meeting$ |
Disallow | /meeting/$ |
Disallow | /TarsCloud$ |
Disallow | /rtthread$ |
Disallow | /linuxfoundation$ |
Disallow | /sofastack$ |
Disallow | /mindspore$ |
Disallow | /explore/application-tools$ |
Disallow | /openeuler$ |
Disallow | /explore/web-app-develop$ |
Disallow | /explore/server-app$ |
Disallow | /product-designer-at-capbase/$ |
Disallow | /senior-javascript-engineer-at-capbase/$ |
Disallow | /partner/refer |
Disallow | /app/$ |
Disallow | /app$ |
Disallow | /blog/author/bryce/$ |
Other Records
Field | Value |
---|---|
sitemap | https://capbase.com/sitemap.xml |