sflixto.net
robots.txt

Robots Exclusion Standard data for sflixto.net

Resource Scan

Scan Details

Site Domain sflixto.net
Base Domain sflixto.net
Scan Status Ok
Last Scan2025-12-14T05:22:12+00:00
Next Scan 2026-01-13T05:22:12+00:00

Last Scan

Scanned2025-12-14T05:22:12+00:00
URL https://sflixto.net/robots.txt
Domain IPs 104.21.24.121, 172.67.218.133, 2606:4700:3032::6815:1879, 2606:4700:3035::ac43:da85
Response IP 104.21.24.121
Found Yes
Hash 0ddacf08666257e4025fb620f8b27bce02a171eb39f76c2f080d7e89a53c05c4
SimHash 68501530fde6

Groups

*

Rule Path
Disallow /search
Allow /search/?
Disallow /acp/
Disallow /ucp/

gptbot

Rule Path
Disallow /actor/
Disallow /director/

facebot
bingbot
bingpreview
twitterbot
twitterbot

Rule Path
Allow /imgres

facebookexternalhit

Rule Path
Allow /imgres

Other Records

Field Value
sitemap https://sflixto.net/sitemap.xml

Comments

  • Crawlers of certain social media sites are allowed to access page markup when google.com/imgres* links are shared. To learn more, please contact images-robots-allowlist@google.com.

Warnings

  • 2 invalid lines.