There are many possible future architectural changes that could help. Some are more feasible than others but all require fundamental advancement to be useful and cost effective.
Some of them include:
- GPU - CPU shared memory (already possible just not yet the standard)
- Higher DRAM bandwidth (already possible, just not yet a priority)
- system on chip FPGAs (always possible just very expensive to fit “AI models”)
- SOC NVM. Ideally even NVM on the same wafer as the GPU and CPU (possible today but requires a lot of work on the yield. NVM would take up a lot of real estate that could ruin yield).
Some of them include:
- GPU - CPU shared memory (already possible just not yet the standard)
- Higher DRAM bandwidth (already possible, just not yet a priority)
- system on chip FPGAs (always possible just very expensive to fit “AI models”)
- SOC NVM. Ideally even NVM on the same wafer as the GPU and CPU (possible today but requires a lot of work on the yield. NVM would take up a lot of real estate that could ruin yield).
- analog circuits
- new semi-conductors / photonics
- memristors