mazars.to
robots.txt

Robots Exclusion Standard data for mazars.to

Resource Scan

Scan Details

Site Domain mazars.to
Base Domain mazars.to
Scan Status Failed
Failure StageFetching resource.
Failure ReasonRequest timed out.
Last Scan2025-12-21T01:55:47+00:00
Next Scan 2026-03-21T01:55:47+00:00

Last Successful Scan

Scanned2023-08-25T11:32:53+00:00
URL http://mazars.to/robots.txt
Domain IPs 20.216.153.181
Response IP 20.216.153.181
Found Yes
Hash 2b031b85f66b8a7710eebce83ef3932dcf30488c9b22e3171c13653081371c24
SimHash c1bc40b4b742

Groups

*

Rule Path
Disallow /mazarspage/
Disallow /mazarsforms/
Disallow /content/download/
Disallow /layout/set/
Disallow /content/browse/
Disallow /content/view/
Disallow /user/login
Disallow /mazarsuser/
Disallow /content/search
Disallow /mazarsfind/
Disallow /mazarsAjax/

Other Records

Field Value
crawl-delay 4

googlebot

Rule Path
Disallow /robots.txt
Disallow /content/download/*
Disallow /layout/set/*
Disallow /mazarspage/*
Allow /mazarspage/video_api/
Allow /mazarspage/our_office*
Disallow /content/download*/
Disallow /layout/set*/
Disallow /content/browse*/
Disallow /content/browse/*
Disallow /content/view/*
Allow /content/view/sitemap/*
Disallow /Global-contents/*
Disallow /Media/*
Disallow /mazarsuser/*
Allow /mazarsuser/directorylist/
Disallow /content/search*

googlebot-news

Rule Path
Disallow /

semrushbot

Rule Path
Disallow /

adidxbot

Rule Path
Disallow /

ia_archiver

Rule Path
Disallow /

scoutjet

Rule Path
Disallow /

facebot

Rule Path
Disallow /

searchmetricsbot

Rule Path
Disallow /

wget

Rule Path
Disallow /

turnitinbot

Rule Path
Disallow /

httrack

Rule Path
Disallow /