theonelegian.com
robots.txt
Robots Exclusion Standard data for theonelegian.com
Resource Scan
Scan Details
Site Domain | theonelegian.com |
Base Domain | theonelegian.com |
Scan Status | Ok |
Last Scan | 2024-11-10T06:03:23+00:00 |
Next Scan | 2024-12-10T06:03:23+00:00 |
Last Scan
Scanned | 2024-11-10T06:03:23+00:00 |
URL | https://www.theonelegian.com/robots.txt |
Domain IPs | 13.33.88.108, 13.33.88.15, 13.33.88.40, 13.33.88.81 |
Response IP | 13.33.88.81 |
Found | Yes |
Hash | b109ce3df85cf4c81b5f0068cbd3e520077084dfec8102be21434d45388a466f |
SimHash | c19d53734da4 |
Groups
*
Rule | Path |
---|---|
Disallow | /covid19/ |
Disallow | /m2/covid19/ |
Disallow | /a/ |
Disallow | /booking/ |
Disallow | /bookcore/ |
Disallow | /m/search |
Disallow | /meetingroom_engine/ |
Disallow | /*? |
Allow | /*utm_source%3Dgoogle |
Allow | /*.js$ |
Allow | /*.css$ |
Allow | /bookcore/static/webmobile/* |
Disallow | /ajax/modals/ |
Disallow | /ajax/modals/* |
Other Records
Field | Value |
---|---|
sitemap | https://www.theonelegian.com/sitemap.xml |
sitemap | https://www.theonelegian.com/m2/sitemap.xml |