infobel.pt
robots.txt

Robots Exclusion Standard data for infobel.pt

Resource Scan

Scan Details

Site Domain infobel.pt
Base Domain infobel.pt
Scan Status Ok
Last Scan2024-05-10T13:50:58+00:00
Next Scan 2024-05-17T13:50:58+00:00

Last Scan

Scanned2024-05-10T13:50:58+00:00
URL https://infobel.pt/robots.txt
Redirect https://local.infobel.pt/robots.txt
Redirect Domain local.infobel.pt
Redirect Base infobel.pt
Domain IPs 194.7.35.240
Redirect IPs 194.7.35.218
Response IP 194.7.35.218
Found Yes
Hash 6f208a3f710539d1a4550277fa3a400e86371831b84bfd4f2f4ff2356716fa98
SimHash 584c884922b3

Groups

ccbot

Rule Path
Disallow /

chatgpt-user

Rule Path
Disallow /

gptbot

Rule Path
Disallow /

anthropic-ai

Rule Path
Disallow /

claudebot

Rule Path
Disallow /

omgilibot

Rule Path
Disallow /

omgili

Rule Path
Disallow /

facebookbot

Rule Path
Disallow /

diffbot

Rule Path
Disallow /

bytespider

Rule Path
Disallow /

imagesiftbot

Rule Path
Disallow /

cohere-ai

Rule Path
Disallow /

*

Rule Path
Disallow */Profile/GetDetailsReviews/*
Disallow */Search/GetUpperDetailsLinks*
Disallow */Search/GetBottomLinks*
Disallow */Search/GetUpperDetailsLinksAsync*
Disallow */Search/GetBottomLinksAsync*
Disallow */Search/LoadEmailModal*
Disallow */Search/LogRevealedPhone*
Disallow */Search/GetTopCompetitors*
Disallow */Search/GetSnapshotImage*
Disallow */Search/CategoryResults*
Disallow */Search/GetMediaFile*
Disallow */Search/LogWebsiteClick*
Disallow */Search/SetSorting*
Disallow */Search/GetQRCode*
Disallow */Search/GetDataLink*
Disallow */Search/BusinessResults*
Disallow */Search/BusinessDetails*