The Decadal Plan for Semiconductors is a call for innovative solutions in semiconductor design and manufacturing to suit the growing needs of information and communication. It identified five critical problems that could be of further research while reaching the end of Moore’s law.

The Decadal Plan website can be found here: https://www.src.org/about/decadal-plan/.

Analog Hardware

Real-world sensory inputs are analog; modern AI/robotic systems absorb outside information and perform calculations on them. In the past, computation (compute per dollar) grows exponentially with the exponential growth of data, so we are not having much trouble. However, with the end of Moore’s law, the plethora of information is almost certainly guaranteed to obliterate the computation power we have.

The human brain does not work with a lot of data. Instead, it focuses on what is important, known as the attention mechanism as understood by psychology and has been exploited by machine learning algorithms. This mechanism has made machine learning algorithms perform much better, but such attention pruning happens after all data is collected and stored, which consumes lots of bandwidth, storage, and computation.

This section has proposed that attention pruning can be introduced at the point when data is collected with analog computation. Analog computation has the advantage of low power consumption. They can also provide more computation power when used to operate on multi-bit analog data or massive parallelism when used together with things such as in-memory computing. More importantly, analog computation fits naturally when the sensory data is inherently analog.

Analog computation was nothing new. It began around the same as digital computing. Simple adders and multiplexers can be made out of operational amplifiers and transistors. However, they soon fell out of favor because the result is usually not accurate due to interference and noise. This issue grows heavier as we move to higher-resolution numbers, such as 32-bit and 64-bit integers and floating numbers. Analog computing needs some hard work to get to 16-bit resolution, and practically impossible, as seen today on 32-bits. With the growth of machine learning, however, accuracy gets less critical in recent machine learning computations: Some models even work with 8-bit or 4-bit resolution numbers, and we saw analog in-memory computing modules performing batch matrix-vector multiplications.

This agenda proposes more investment in analog computing to cope with the growing amount of data and communication needs.

New Memory and Storage Solutions

The idea stated in the previous section may reduce the amount of data we need to save, but before that is fully realized, considering data scientists will still need the full amount of data to study them (such as studying how to prune them,) the demand for more storage is not going anywhere soon.

This idea seems less imminent since storage technology is still scaling: DDRs are getting denser with more levels of trenches, SSDs are getting higher bit densities with 3D stacking, and even hard drives are getting a boost with long-advertised HAMR technologies (that has yet to deliver, though, after all these years …) However, as calculated by this document, if we want to store all the data we produce on NAND flashes, the amount of silicon we need will outrun the total silicon supply. I do not fully agree with this point: Not all data need to be stored, such as Instagram stories or surveillance camera footage. For cold data, we have alternate storage solutions such as hard drives or even tapes.

The document then called for innovative and efficient memory that is fast, dense, and cost-effective. There is also the need for memory to support emerging architectures such as quantum computers. Unfortunately, we did not see any of the current solutions providing such a benefit. DNA storage, for example, provides high density and may get cost-effective in the future (not much knowledge in that area), but it is very slow and prone to data loss.

The final point to point out is that this has less to do with Moore’s law.

Communication Capacity

Communication comes right after storage. Data need to be transferred to where they need to be stored and delivered to where they are wanted. More data means more communication. This will not only lead to a need for higher bandwidth but also the need for a way to transfer data in an energy-efficient way.

The first way out is obvious: Edge computing and solution, so the data does not need to be moved that far and that often. This leads to more topics in network routing and discovery, distributed computation algorithms, and information security challenges.

The second evident approach is to increase the bandwidth. The total worldwide bandwidth is continuing to grow. Though it might become hard to increase the bandwidth over a single link, most network applications are scale-out network systems designed to be more and more distributed, load-balanced, and redundant, so that is not really a problem.

One final point they mentioned is the advancement in wireless communication. Wireless performance can be improved by increasing the frequency (all the way to the THz range) as well as increasing the number of antennas. I am not a great fan of such ideas: Wireless communication is costly compared to wires communication in terms of money and energy, they are not as reliable, they may have security problems, and they may cause electromagnetic interference issues. 5G is already overkill for most mobile phone users. Industrial wireless solutions exist, but more of them focus on reliability or reachability, not bandwidth or speed.

Security

The fourth challenge is security. Complex systems exhibit more weak points or explorable subsystems, such as cache, prefetching, and speculative execution. But more importantly, the prevalence of AI and computing has brought more problems.

AIs are not trustworthy. They have no guarantee of correctly identifying examples they have not seen before, and they could be deceived by a couple of stickers and think a traffic sign is something the attacker wants it to see and completely different. More intelligent security solutions might be needed for such attacks.

The migration to cloud computing is also posing a risk to security. A multitenant data center environment has lots of unknown actors on the same network, and sometimes you do not even want to trust the cloud provider. We have no way of making sure the calculation results from the cloud server are correct or have not been tampered with if we do not perform some cryptographic verification. Several security enclaving techniques and trusted execution environments have been proposed, but few of them stayed secure with side channels and various other kinds of attacks.

Security is a big, big topic, and it is never a new problem, but the problem does not seem to grow bigger with the end of Moore’s law, yet we still have to keep it in mind.

Energy-Efficient Computing

Dennard scaling ended long before the coming end of Moore’s law. If we want more calculations, we have to pay more energy, and the amount of energy paid to calculation is already a significant amount of energy generation we generate. If we want to have more computation, with the current level of energy consumption, we may soon run to the cap.

The report proposes to alter the trajectory of computing and find completely different solutions, yet it is completely unknown what solution might be out there. Quantum computing is an entirely different architecture, and we do not really have an idea how much energy they would consume once they are fully placed into applications.

Conclusion

The Decadal Plan for Semiconductors is a document guiding where the silicon industry should proceed, reaching the end of Moore’s law. We should seek to either grow computation without relying on Moore’s law or strategically perform less computation to achieve our needs. Moreover, there are lots of challenges above Moore’s law that we also need to pay attention to.

References

[1]
N. Verma et al., “In-Memory Computing: Advances and Prospects,” IEEE Solid-State Circuits Magazine, vol. 11, no. 3, pp. 43–55, 2019, doi: 10.1109/MSSC.2019.2922889
[1]
K. Eykholt et al., “Robust Physical-World Attacks on Deep Learning Visual Classification,” in 2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition, Jun. 2018, pp. 1625–1634. doi: 10.1109/CVPR.2018.00175
[1]
J. Ang et al., “Decadal Plan for Semiconductors - Abridged Report”, Available: https://www.src.org/about/decadal-plan/
[1]
International Telecommunication Union, “International bandwidth usage 2021.” Available: https://www.itu.int/itu-d/reports/statistics/2021/11/15/international-bandwidth-usage. [Accessed: May 14, 2023]
[1]
P. Kocher et al., “Spectre Attacks: Exploiting Speculative Execution,” in 2019 IEEE Symposium on Security and Privacy (SP), May 2019, pp. 1–19. doi: 10.1109/SP.2019.00002

Leave a Reply

Your email address will not be published. Required fields are marked *