My research interests lie in the area of Computer Vision and Machine Learning. I am especially interested in leveraging knowledge from foundation model to augment the capabilities of autonomous agents, which can further liberate individuals from mundane and repetitive tasks. My ultimate goal is to build an intelligent agent that can help us explore the world and expand our boundaries of perception, decision and action.
We introduce token-wise consistency terms between the image content and object segmentation maps in training text-to-image models for enhanced multi-category instance composition and photorealism.
We propose OmniControlNet, a dual-stage integration framework for conditional image generation, which could effectively generate different controls and corresponding images in a unified framework.
Implement the PBRS algorithm specifically tailored for the Unitree H1 Robot. Enable the H1 Robot to walk following given linear velocity and angular velocity commands without falling.
This is the source code from Jon Barron. Thanks to him for sharing this beautiful template!