Recently, we talked to Dan Fu and Tri Dao – authors of “Hungry Hungry Hippos” (aka “H3”) – on our Deep Papers podcast. H3 is a proposed language modeling architecture that performs comparably to ...
Cerebras Systems customer GSK speaks about the importance of CS-2's long sequence length capability for improving accuracy in natural language processing models. According to Kim Branson, senior vice ...