__ingeniare__ t1_jdhxcds wrote
Reply to comment by BinarySplit in [D] I just realised: GPT-4 with image input can interpret any computer screen, any userinterface and any combination of them. by Balance-
I would think image segmentation for UI to identify clickable elements and the like is a very solvable task
Viewing a single comment thread. View all comments