manpower.pl
robots.txt

Robots Exclusion Standard data for manpower.pl

Resource Scan

Scan Details

Site Domain manpower.pl
Base Domain manpower.pl
Scan Status Ok
Last Scan2024-11-06T02:38:34+00:00
Next Scan 2024-11-20T02:38:34+00:00

Last Scan

Scanned2024-11-06T02:38:34+00:00
URL https://manpower.pl/robots.txt
Redirect https://www.manpower.pl/robots.txt
Redirect Domain www.manpower.pl
Redirect Base manpower.pl
Domain IPs 104.18.34.236, 172.64.153.20, 2606:4700:4400::6812:22ec, 2606:4700:4400::ac40:9914
Redirect IPs 104.18.34.236, 172.64.153.20, 2606:4700:4400::6812:22ec, 2606:4700:4400::ac40:9914
Response IP 172.64.153.20
Found Yes
Hash cb9f852fca17ed6ca8cd518f4a394b71ca470daa3ad4d9afa5e91199d65e74de
SimHash 2a50b992c001

Groups

*

Rule Path
Disallow /
Disallow /en/for-job-seekers/manpower-myplan-app

slurp

Rule Path
Allow /
Disallow /uk
Disallow /candidate

teoma

Rule Path
Allow /
Disallow /candidate

google

Rule Path
Allow /
Disallow /candidate

msnbot

Rule Path
Allow /
Disallow /candidate

exabot

Rule Path
Allow /
Disallow /candidate

bingbot

Rule Path
Allow /
Disallow /candidate

applebot

Rule Path
Allow /
Disallow /candidate

aolbuild

Rule Path
Allow /
Disallow /candidate

googlebot

Rule Path
Allow /
Disallow /candidate

deepcrawl

Rule Path
Allow /
Disallow /candidate

ahrefsbot

Rule Path
Allow /
Disallow /candidate

bingpreview

Rule Path
Allow /
Disallow /candidate

duckduckbot

Rule Path
Allow /
Disallow /candidate

ia_archiver

Rule Path
Allow /
Disallow /candidate

facebot/1.0

Rule Path
Allow /
Disallow /candidate

siteauditbot

Rule Path
Allow /
Disallow /candidate

adsbot-google

Rule Path
Allow /
Disallow /candidate

semrushbot-sa

Rule Path
Allow /
Disallow /candidate

semrushbot-ba

Rule Path
Allow /
Disallow /candidate

semrushbot-si

Rule Path
Allow /
Disallow /candidate

semrushbot-ct

Rule Path
Allow /
Disallow /candidate

semrushbot-bm

Rule Path
Allow /
Disallow /candidate

semrushbot-swa

Rule Path
Allow /
Disallow /candidate

splitsignalbot

Rule Path
Allow /
Disallow /candidate

archive.org_bot

Rule Path
Allow /
Disallow /candidate

mediapartners-google

Rule Path
Allow /
Disallow /candidate

screaming frog seo spider

Rule Path
Allow /
Disallow /candidate

Other Records

Field Value
sitemap https://www.manpower.pl/sitemap.xml