How has DeepSeek improved the Transformer architecture? (epoch.ai)3 points by h8hawk 15 hours ago | 0 comments