3 Commits (d35267e85a865c30a5fa63fdb0a21f94f4cc37e7)

Author SHA1 Message Date
comfyanonymous 174eba8e95 Use own clip vision model implementation. 1 year ago
comfyanonymous efb704c758 Support attention masking in CLIP implementation. 1 year ago
comfyanonymous fbdb14d4c4 Cleaner CLIP text encoder implementation.
Use a simple CLIP model implementation instead of the one from
transformers.

This will allow some interesting things that would too hackish to implement
using the transformers implementation.
1 year ago