cinemacafe.net
robots.txt

Robots Exclusion Standard data for cinemacafe.net

Resource Scan

Scan Details

Site Domain cinemacafe.net
Base Domain cinemacafe.net
Scan Status Ok
Last Scan2024-11-09T16:23:49+00:00
Next Scan 2024-11-16T16:23:49+00:00

Last Scan

Scanned2024-11-09T16:23:49+00:00
URL https://cinemacafe.net/robots.txt
Redirect https://www.cinemacafe.net/robots.txt
Redirect Domain www.cinemacafe.net
Redirect Base cinemacafe.net
Domain IPs 211.14.31.65
Redirect IPs 211.14.31.65
Response IP 211.14.31.65
Found Yes
Hash 59d58a5f3059483d73b2d69842b7e192554b1d18a39eebdaea2d1c16c9ab48e8
SimHash 691d0910c1f3

Groups

gptbot

Rule Path
Disallow /

oai-searchbot

Rule Path
Disallow /

chatgpt-user

Rule Path
Disallow /

applebot-extended

Rule Path
Disallow /

ccbot

Rule Path
Disallow /

claudebot

Rule Path
Disallow /

imagesiftbot

Rule Path
Disallow /

perplexitybot

Rule Path
Disallow /

velenpublicwebcrawler

Rule Path
Disallow /

google-extended

Rule Path
Disallow /

*

Rule Path
Disallow /test/

bingbot

No rules defined. All paths allowed.

Other Records

Field Value
crawl-delay 5

Other Records

Field Value
sitemap https://www.cinemacafe.net/sitemap/cinemacafe.net-index.xml.gz
sitemap https://www.cinemacafe.net/movies/sitemap/index.xml