Library
Coarse-to-Fine Vision-Language Pre-training with Fusion in the Backbone | Microsoft NYU UCLA NeurIPS | DSAI by Dr. Osbert Tay | Podwise