talent.maworldgroup.com
robots.txt
Robots Exclusion Standard data for talent.maworldgroup.com
Resource Scan
Scan Details
Site Domain | talent.maworldgroup.com |
Base Domain | maworldgroup.com |
Scan Status | Ok |
Last Scan | 2024-10-24T10:52:43+00:00 |
Next Scan | 2024-11-23T10:52:43+00:00 |
Last Scan
Scanned | 2024-10-24T10:52:43+00:00 |
URL | https://talent.maworldgroup.com/robots.txt |
Domain IPs | 104.26.8.65, 104.26.9.65, 172.67.74.45, 2606:4700:20::681a:841, 2606:4700:20::681a:941, 2606:4700:20::ac43:4a2d |
Response IP | 172.67.74.45 |
Found | Yes |
Hash | 4f16ae19cf6bf5cea8afa0d10a02a6c4d1873ec7f4366b386be200a64bee3abf |
SimHash | f1de9140c312 |
Groups
*
Rule | Path |
---|---|
Disallow | /download_pdf/ |
Disallow | /preview_pdf/ |
Disallow | /print/ |
Disallow | /news/pdf/ |
Disallow | /blog/pdf/ |
Disallow | /news_items/preview/ |
Disallow | /lightbox/ |
Disallow | /custom_lightbox/ |
Disallow | /custom_portfolio/ |
Disallow | /syndication/ |
Disallow | /newsletter/ |
Disallow | /syndication/email_lightbox/ |
Other Records
Field | Value |
---|---|
crawl-delay | 5 |