www-orig.cma-cgm.com
robots.txt

Robots Exclusion Standard data for www-orig.cma-cgm.com

Resource Scan

Scan Details

Site Domain www-orig.cma-cgm.com
Base Domain cma-cgm.com
Scan Status Failed
Failure StageFetching resource.
Failure ReasonServer returned a client error.
Last Scan2025-10-05T07:36:11+00:00
Next Scan 2025-11-04T07:36:11+00:00

Last Successful Scan

Scanned2024-06-28T01:53:04+00:00
URL https://www-orig.cma-cgm.com/robots.txt
Domain IPs 193.109.119.8
Response IP 193.109.119.8
Found Yes
Hash 6dd35b5d63d39b9a055f00c2038ea3305ab652998a98e69f30f70721b1895567
SimHash d90948c22fb0

Groups

*

Rule Path
Disallow /api/*
Disallow /health-monitoring
Disallow /static/Communication/Attachments/CMACGM_MAGAZINE_60_FR_Print_Def5_light.pdf
Disallow /detail-news/2523/cma-cgm-devoile-sa-conception-d-un-parcours-client-digital-et-lance-cma-cgm-esolutions?cat=ebusiness
Disallow /static/Communication/Attachments/BROCHURE%20CMA%20CGM%20LOG_Web.pdf
Disallow /static/Communication/Attachments/Mag43fr.pdf
Disallow /static/Communication/Attachments/CMACGM_USA_Brochure_201706.pdf
Disallow /static/Communication/Attachments/CMACGM_Career_Brochure_EN_201706.pdf
Disallow /documents-vas