amp.tribuna.expresso.pt
robots.txt

Robots Exclusion Standard data for amp.tribuna.expresso.pt

Resource Scan

Scan Details

Site Domain amp.tribuna.expresso.pt
Base Domain expresso.pt
Scan Status Ok
Last Scan2024-11-03T07:33:42+00:00
Next Scan 2024-12-03T07:33:42+00:00

Last Scan

Scanned2024-11-03T07:33:42+00:00
URL https://amp.tribuna.expresso.pt/robots.txt
Domain IPs 2600:9000:269a:4000:a:942:5e80:93a1, 2600:9000:269a:7400:a:942:5e80:93a1, 2600:9000:269a:8e00:a:942:5e80:93a1, 2600:9000:269a:9200:a:942:5e80:93a1, 2600:9000:269a:a200:a:942:5e80:93a1, 2600:9000:269a:aa00:a:942:5e80:93a1, 2600:9000:269a:bc00:a:942:5e80:93a1, 2600:9000:269a:f800:a:942:5e80:93a1, 3.160.188.120, 3.160.188.123, 3.160.188.3, 3.160.188.73
Response IP 18.165.122.39
Found Yes
Hash d1ffb881a599d22ad83ec0530cd084756db6d63b6fbff0ba922f0e081d63cafa
SimHash 6903f7a1d6f1

Groups

googlebot

Rule Path
Disallow /ads-test.html
Allow /

googlebot-news

Rule Path
Disallow /ads-test.html
Allow /

googlebot-image

Rule Path
Disallow /ads-test.html
Allow /

googlebot-video

Rule Path
Disallow /ads-test.html
Allow /

mediapartners-google

Rule Path
Disallow /ads-test.html
Allow /

adsbot-google

Rule Path
Disallow /ads-test.html
Allow /

adsbot-google-mobile

Rule Path
Disallow /ads-test.html
Allow /

bingbot

Rule Path
Disallow /ads-test.html
Allow /

msnbot

Rule Path
Disallow /ads-test.html
Allow /

msnbot-media

Rule Path
Disallow /ads-test.html
Allow /

slurp

Rule Path
Disallow /ads-test.html
Allow /

duckduckbot

Rule Path
Disallow /ads-test.html
Allow /

facebookexternalhit

Rule Path
Disallow /ads-test.html
Allow /

facebot

Rule Path
Disallow /ads-test.html
Allow /

ia_archiver

Rule Path
Disallow /ads-test.html
Allow /

twitterbot

Rule Path
Disallow /ads-test.html
Allow /

linkedinbot

Rule Path
Disallow /ads-test.html
Allow /

rogerbot

Rule Path
Disallow /ads-test.html
Allow /

dotbot

Rule Path
Disallow /ads-test.html
Allow /

*

Rule Path
Disallow /

Other Records

Field Value
sitemap http://postal.pt/sitemap/news.xml