talent.maworldgroup.com
robots.txt

Robots Exclusion Standard data for talent.maworldgroup.com

Resource Scan

Scan Details

Site Domain talent.maworldgroup.com
Base Domain maworldgroup.com
Scan Status Ok
Last Scan2024-10-24T10:52:43+00:00
Next Scan 2024-11-23T10:52:43+00:00

Last Scan

Scanned2024-10-24T10:52:43+00:00
URL https://talent.maworldgroup.com/robots.txt
Domain IPs 104.26.8.65, 104.26.9.65, 172.67.74.45, 2606:4700:20::681a:841, 2606:4700:20::681a:941, 2606:4700:20::ac43:4a2d
Response IP 172.67.74.45
Found Yes
Hash 4f16ae19cf6bf5cea8afa0d10a02a6c4d1873ec7f4366b386be200a64bee3abf
SimHash f1de9140c312

Groups

amazonbot

Rule Path
Disallow /

blexbot

Rule Path
Disallow /

bytespider

Rule Path
Disallow /

ccbot

Rule Path
Disallow /

chatgpt-user

Rule Path
Disallow /

claude-web

Rule Path
Disallow /

claudebot

Rule Path
Disallow /

diffbot

Rule Path
Disallow /

google-extended

Rule Path
Disallow /

gptbot

Rule Path
Disallow /

imagesiftbot

Rule Path
Disallow /

kinza

Rule Path
Disallow /

liebaofast

Rule Path
Disallow /

mj12bot

Rule Path
Disallow /

mb2345browser

Rule Path
Disallow /

micromessenger

Rule Path
Disallow /

perplexitybot

Rule Path
Disallow /

phxbot

Rule Path
Disallow /

semrushbot

Rule Path
Disallow /

ucbrowser

Rule Path
Disallow /

velenpublicwebcrawler

Rule Path
Disallow /

dataforseo

Rule Path
Disallow /

omgili

Rule Path
Disallow /

opensiteexplorer

Rule Path
Disallow /

petalsearch

Rule Path
Disallow /

*

Rule Path
Disallow /download_pdf/
Disallow /preview_pdf/
Disallow /print/
Disallow /news/pdf/
Disallow /blog/pdf/
Disallow /news_items/preview/
Disallow /lightbox/
Disallow /custom_lightbox/
Disallow /custom_portfolio/
Disallow /syndication/
Disallow /newsletter/
Disallow /syndication/email_lightbox/

Other Records

Field Value
crawl-delay 5