News
A framework to enable multimodal models to operate a computer. Using the same inputs and outputs as a human operator, the model views the screen and decides on a series of mouse and keyboard actions ...
Some results have been hidden because they may be inaccessible to you
Show inaccessible results