cari.com.pt
robots.txt

Robots Exclusion Standard data for cari.com.pt

Resource Scan

Scan Details

Site Domain cari.com.pt
Base Domain cari.com.pt
Scan Status Ok
Last Scan2026-01-03T05:39:56+00:00
Next Scan 2026-01-10T05:39:56+00:00

Last Scan

Scanned2026-01-03T05:39:56+00:00
URL https://cari.com.pt/robots.txt
Redirect https://www.cari.com.pt/robots.txt
Redirect Domain www.cari.com.pt
Redirect Base cari.com.pt
Domain IPs 144.76.139.199
Redirect IPs 144.76.139.199
Response IP 144.76.139.199
Found Yes
Hash 84adb74b64aedbed202350e39fef190b2bbfca899e39e5fa3fc6e3a2a244e352
SimHash 749c085b80e4

Groups

*

Rule Path
Allow /
Disallow /index.php?*
Disallow /ajax/*
Disallow /motor/view/*
Disallow /mobil/view/*
Disallow /rumah/view/*
Disallow /lowongan-kerja/view/*
Disallow /shopping/view/*
Disallow /index.php?keyword=*
Disallow /index.php?make=*
Disallow /CAR_FOLDER/*
Disallow /MOTOR_FOLDER/*
Disallow /HOMES_FOLDER/*
Disallow /JOBS_FOLDER/*
Disallow /SHOPPING_FOLDER/*
Disallow /keyword/*
Disallow /lowogan-kerja/*
Disallow /cars/view/*
Disallow /motorcycles/view/*
Disallow /homes/view/*
Disallow /jobs/view/*
Disallow /autos/view/*
Disallow /motos/view/*
Disallow /casas/view/*
Disallow /empleo/view/*
Disallow /empleos/view/*

ccbot

Rule Path
Disallow /

chatgpt-user

Rule Path
Disallow /

gptbot

Rule Path
Disallow /

google-extended

Rule Path
Disallow /

google-cloudvertexbot

Rule Path
Disallow /

applebot-extended

Rule Path
Disallow /

anthropic-ai

Rule Path
Disallow /

claudebot

Rule Path
Disallow /

omgilibot

Rule Path
Disallow /

omgili

Rule Path
Disallow /

facebookbot

Rule Path
Disallow /

diffbot

Rule Path
Disallow /

duckassistbot

Rule Path
Disallow /

ai2bot

Rule Path
Disallow /

bytespider

Rule Path
Disallow /

kangaroo bot

Rule Path
Disallow /

pangubot

Rule Path
Disallow /

imagesiftbot

Rule Path
Disallow /

perplexitybot

Rule Path
Disallow /

cohere-ai

Rule Path
Disallow /

cohere-training-data-crawler

Rule Path
Disallow /

meta-externalagent

Rule Path
Disallow /

meta-externalfetcher

Rule Path
Disallow /

timpibot

Rule Path
Disallow /

webzio-extended

Rule Path
Disallow /

youbot

Rule Path
Disallow /