Meta AI Unleashes Megabyte, a Revolutionary Scalable Model Architecture
Created on 2023-05-24T17:31:42-05:00
Facebook splits inputs in to batches of tokens and sends each batch to its own transformer network to be processed. The output of each batch is then combined with an overlord network to make the independently produced results coherent as a whole.