rallit.com
robots.txt
Robots Exclusion Standard data for rallit.com
Resource Scan
Scan Details
Site Domain | rallit.com |
Base Domain | rallit.com |
Scan Status | Ok |
Last Scan | 2024-11-05T15:12:19+00:00 |
Next Scan | 2024-12-05T15:12:19+00:00 |
Last Scan
Scanned | 2024-11-05T15:12:19+00:00 |
URL | https://rallit.com/robots.txt |
Redirect | https://www.rallit.com:443/robots.txt |
Redirect Domain | www.rallit.com |
Redirect Base | rallit.com |
Domain IPs | 18.155.68.19, 18.155.68.39, 18.155.68.41, 18.155.68.87 |
Redirect IPs | 18.155.68.19, 18.155.68.39, 18.155.68.41, 18.155.68.87 |
Response IP | 18.155.68.19 |
Found | Yes |
Hash | 8c771330adce73a0d8475144df10f5fbe944c0c30902d945f574752aeb1f05f0 |
SimHash | ea6086c46592 |
Groups
*
Rule | Path |
---|---|
Allow | /resumes |
Disallow | /sentry_sample_error |
Disallow | /applicants/* |
Disallow | /apply |
Disallow | /auth |
Disallow | /my |
Disallow | /resume$ |
Disallow | /resume-pdf |
Disallow | /webview/* |
Disallow | /companies/788 |
Other Records
Field | Value |
---|---|
sitemap | https://www.rallit.com/sitemap.xml |
sitemap | https://rallit.com/server-sitemap.xml |
Warnings
- `host` is not a known field.
Comments