3 Commits (9e411073e901f766118a7b82f613872fd745ecc2)

Author SHA1 Message Date
comfyanonymous 174eba8e95 Use own clip vision model implementation. 1 year ago
comfyanonymous efb704c758 Support attention masking in CLIP implementation. 1 year ago
comfyanonymous fbdb14d4c4 Cleaner CLIP text encoder implementation.
Use a simple CLIP model implementation instead of the one from
transformers.

This will allow some interesting things that would too hackish to implement
using the transformers implementation.
1 year ago