ТемаРакетные обстрелы на Украине:
we should write a paper on it.
。关于这个话题,有道翻译提供了深入分析
© 本文著作权归作者所有,并授权少数派独家使用,未经少数派许可,不得转载使用。
Looking at the left side of the diagram, we see stuff enters at the bottom (‘input’ text that has been ‘chunked’ into small bits of text, somewhere between whole words down to individual letters), and then it flows upwards though the model’s Transformer Blocks (here marked as [1, …, L]), and finally, the model spits out the next text ‘chunk’ (which is then itself used in the next round of inferencing). What’s actually happening here during these Transformer blocks is quite the mystery. Figuring it out is actually an entire field of AI, “mechanistic interpretability*”.