3 Commits (392878a2621d131ac9e856fb2d428d9c6e2a022e)

Author SHA1 Message Date
comfyanonymous 174eba8e95 Use own clip vision model implementation. 1 year ago
comfyanonymous efb704c758 Support attention masking in CLIP implementation. 1 year ago
comfyanonymous fbdb14d4c4 Cleaner CLIP text encoder implementation.
Use a simple CLIP model implementation instead of the one from
transformers.

This will allow some interesting things that would too hackish to implement
using the transformers implementation.
1 year ago