cupra.site
robots.txt

Robots Exclusion Standard data for cupra.site

Resource Scan

Scan Details

Site Domain cupra.site
Base Domain cupra.site
Scan Status Ok
Last Scan2024-09-07T05:44:35+00:00
Next Scan 2024-10-07T05:44:35+00:00

Last Scan

Scanned2024-09-07T05:44:35+00:00
URL https://cupra.site/robots.txt
Redirect https://sexalice.com/robots.txt
Redirect Domain sexalice.com
Redirect Base sexalice.com
Domain IPs 104.21.4.9, 172.67.131.113, 2606:4700:3034::ac43:8371, 2606:4700:3036::6815:409
Redirect IPs 104.21.49.168, 172.67.164.252, 2606:4700:3031::ac43:a4fc, 2606:4700:3035::6815:31a8
Response IP 172.67.164.252
Found Yes
Hash 50dd86fd430fb9fae44c9467a83ec48eed8ef962ee8274d60b7c3ca98b6373aa
SimHash 501cc84ac8b2

Groups

*

Rule Path
Allow /wp-admin/admin-ajax.php
Disallow /wp-admin/

ia_archiver

Rule Path
Disallow /

blexbot

Rule Path
Disallow /

dotbot

Rule Path
Disallow /

semrushbot

Rule Path
Disallow /

mj12bot

Rule Path
Disallow /

ahrefsbot

Rule Path
Disallow /

ccbot

Rule Path
Disallow /

chatgpt-user

Rule Path
Disallow /

gptbot

Rule Path
Disallow /

anthropic-ai

Rule Path
Disallow /

claudebot

Rule Path
Disallow /

omgilibot

Rule Path
Disallow /

omgili

Rule Path
Disallow /

facebookbot

Rule Path
Disallow /

twitterbot

Rule Path
Disallow /

diffbot

Rule Path
Disallow /

bytespider

Rule Path
Disallow /

bytedance

Rule Path
Disallow /

imagesiftbot

Rule Path
Disallow /

perplexitybot

Rule Path
Disallow /

cohere-ai

Rule Path
Disallow /

Other Records

Field Value
sitemap https://sexalice.com/sitemap.xml