The future of AI depends on whether we can design next generation hardware that better supports the scaling laws. At some point of time, AI architecture will even be influenced by the design decision on AI hardware. The codesign of AI and hardware will become norm in the future.
Here are some considerations on AI hardware.
An architecture considering activation outliers in LLM inferencing. OliVe (Guo et al., 2023)
Taking the low-bit trend of LLMs into account. An adaptive numerical data type: ANT (Guo et al., 2022). Replacing MAC with Lookup Table (LUT) (Mo et al., 2024)