Issue
Natl Sci Open
Volume 2, Number 4, 2023
Special Topic: Two-dimensional Materials and Devices
Article Number 20220071
Number of page(s) 11
Section Information Sciences
DOI https://doi.org/10.1360/nso/20220071
Published online 29 June 2023

© The Author(s) 2023. Published by China Science Publishing & Media Ltd. and EDP Sciences.

Licence Creative CommonsThis is an Open Access article distributed under the terms of the Creative Commons Attribution License (https://creativecommons.org/licenses/by/4.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.

INTRODUCTION

In recent years, with the rapid development of artificial intelligence (AI) and deep learning, primary advances in computing power are required to complete massive computational tasks more efficiently [1]. Currently, computing systems are built based on the von Neumann architecture, where the processing unit and memory unit are separated. The latency associated with accessing data from the memory unit is the key performance bottleneck for the increasingly prominent AI related workloads [2]. In-memory computing is an alternative approach in which the memory units themselves are organized to process certain computational tasks [3,4]. The removal of data transmission between the processing unit and memory unit can significantly improve the computing performance and effectively avoid the energy cost of data transmission [5]. Various types of memory devices have been investigated for accomplishing in-memory computing, such as dynamic random-access memory (DRAM) [610], flash [1113], resistive random-access memory (RRAM) [1416], and phase change memory (PCM) [17,18]. Each memory device has key features for in-memory computing. Flash has the ability to implement multi-bit storage by applying different write voltages, but its endurance is relatively limited to ~105 and requires a high write voltage. RRAM and PCM are based on an atomic rearrangement mechanism for conductance switching, which has a large resistance window to store multiple intermediate states [14,19]. It should be noted that the intermediate resistive state of the PCM drifts over time toward high resistance, and RRAM has problems with its electrical characteristics varying significantly not only between devices but also between programming cycles of the same device [3].

Theoretically, DRAM has almost unlimited cycling endurance (>1016), which allows for in situ training in neural networks [20]. However, conventional one-transistor-one-capacitor (1T1C) DRAM has a destructive read operation and is a type of binary memory [21], which limits the application of DRAM in the field of in-memory computing. The two-transistor-one-capacitor (2T1C) DRAM structure has been reported previously to fulfill the matrix-vector multiplication (MVM) operation in artificial neural networks (ANN), offering significant reduction in device area and energy consumption [22,23]. However, the conventional approach to achieving multi-bit operations typically involves employing several units that store binary weights through shift operations [24]. The distinct van der Waals characteristics of 2D materials enable us to achieve heterogeneous integration and exceptional device performance [25]. In this study, an alternative approach of utilizing a single unit for multi-bit storage and calculation was introduced. We demonstrated a heterogeneous integrated 2T1C DRAM device based on monolayer molybdenum disulfide (MoS2) and graphene. The ultralow leakage current of the MoS2 field-effect transistor (FET) permitted slow charge leakage, exhibiting a long retention time, and allowing for the retention of multi-level voltage in the 2T1C DRAM. The graphene transistor served as the readout terminal and performed a non-destructive readout of the voltage stored on the capacitor. The channel conductance of graphene transistors can be controlled by the gate voltage, enabling a linear adjustment of the output current. Additionally, the channel displays resistive characteristics that are linearly correlated with the source-drain voltage [26]. The outstanding linear output and transfer characteristics of graphene FET enabled the realization of multi-bit analog multiplication operation on a single unit.

RESULTS AND DISCUSSION

Figure 1A shows a schematic fabrication process of our 2T1C DRAM and an optical microscope image. The equivalent circuit diagram is shown in Figure 1B. The drain and gate terminals of the MoS2 transistor (T1) connect to the weight line voltage Vw and the refresh line voltage Vre, respectively. The source of T1 connects to the bottom plate of the capacitor (C) and shares the potential with the back gate of the graphene transistor (T2) simultaneously. The input voltage Vx is applied to the drain of T2. The top-plate of C and the source of T2 are grounded.

thumbnail Figure 1

Structure of two-transistor-one-capacitor (2T1C) dynamic random-access memory (DRAM) cell and electrical characteristics of MoS2 and graphene transistors. (A) Schematic of our 2T1C DRAM cell fabrication process and optical microscope image of the fabricated device; scale bar: 100 μm. (B) Circuit diagram of 2T1C DRAM containing storage and calculation modules. (C) Transfer curve of the MoS2 field-effect transistor (FET). (D) Transfer and (E) output curves of the graphene FET.

Figure 1C presents the transfer curve of the MoS2 FET. The gate voltage Vg varied from −3 to 3 V, and the applied drain-source voltage was 0.5 V. The MoS2 FET exhibited a high current on/off ratio (107) associated with its ultralow leakage current (156 fA/μm) at Vg = −3 V, which is correlated with a longer retention time of the DRAM and will be discussed later. On the other hand, the graphene FET exhibits a distinct bipolar behavior. As can be seen from the transfer curve of the graphene FET (Figure 1D), the Dirac point is located at Vg = −0.25 V with a current of 1.8 μA/μm, and a maximum current of 5.3 μA/μm was observed at Vg = 3 V. The transfer characteristics of the nine graphene transistors in the array were shown in Figure S1, demonstrating stable Dirac points and good device uniformity. Figure 1E presents the corresponding output characteristics of the graphene FET. Different from the saturation behavior observed in most MoS2 FETs, the graphene FET shows ideal linearity in the range of 0 to 0.5 V, and the output curves can be clearly distinguished with Vg varying from 0 to 2 V. As shown in Figure S2, the dash lines are obtained by linear fitting of the transfer curve and the output curves with coefficients of determination R2 as high as 0.990 and 0.9996, respectively. As Vg varied from 0 to 2 V, the conductance of the graphene transistor increases linearly from 0.6 to 1.3 mS. At low Vd (from 0 to 0.5 V), the channel of the graphene transistor exhibits a resistive characteristic and Id has ideal linearity (see Supplementary information). Based on the C-V curves shown in Figure S3, the capacitance of the flat capacitor is 30 pF at a sweep frequency of 1 kHz. Furthermore, the parasitic capacitance of T2 is only 1.4 pF, accounting for a mere 4.5% of the overall capacitance of the device.

To write data into the 2T1C DRAM, a Vw pulse with a width of 900 ms was input 50 ms earlier than the Vre pulse with a width of 800 ms, as shown in Figures 2A and 2B. During the programming operation, the pulse voltage of Vw was 2 V with a base level of 0 V. When Vre increased from −3 to 3 V, T1 was switched to an on state, and the capacitor was charged to 2 V, the same as Vw. Then, Vre was set back to −3 V and T1 was switched to the off state so that the charge in the capacitor could be retained. As shown in Figure 2C, Vc was 1.94 V when the Vre pulse was withdrawn (0 s). After holding for 9 s, Vc was at a high level of 1.8 V, corresponding to a voltage loss of only 7%. During the erasing operation, Vre was applied to turn on T1 while Vw was maintained at 0 V, then the charge in the capacitor was erased and the voltage potential on the capacitor dropped to a low level. We also demonstrated the fast program/erase speed of Tprogram = 640 μs and Terase = 650 μs as shown in Figure S4.

thumbnail Figure 2

Dynamic programming and erasing operation. Input signals of Vre and Vw in (A) programming and (B) erasing operations. Retention characteristics (C) Vc and (D) Id over 9 s after programming/erasing operations. (E) Dynamic writing and reading by sequential programming and erasing pulses with 9 s interval.

The in situ readout of the data stored in the DRAM was realized with a fixed Vx of 0.5 V, and the output current of T2 (Id) is shown in Figure 3D. Since the non-ground terminal of the capacitor was connected to the gate of T2, Vc also acted as the gate voltage of T2. Under a programming state, Vc remained at a high voltage potential and Id retained a high-current level (~4.8 μA/μm). Conversely, in the erased state, Id retained a low-current level (~2.7 μA/μm). The ratio of the high current to the low current was 1.78 at the beginning and could be maintained at 1.70 after 9 s. The longest retention time of our 2T1C DRAM is 70 s with 10% drop in Imax corroborating the excellent retention performance as shown in Figure S4.

thumbnail Figure 3

Characterization of the 2T1C unit cell. (A) Incremental step programming pulse (ISPP) waveform of Vw, varying from 0.5 to 2 V. Input pulse waveform of Vre. (B) Drain current Id of T2 with Vw varying from 0.5 to 2 V, where Vx = 0.1 V. (C) ISPP waveform applied to Vx, ranging from 0.1 to 0.4 V with 0.1 V increments. The pulse width was 0.1 s. (D) Corresponding Id curves when Vw was fixed at a series of values.

We then demonstrated a continuous writing and reading operation, as shown in Figure 2E. During the 100 s test period, the pulse width of the control signal Vre was 800 ms with a period interval of 10 s. Vw signal had a 2 V pulse with a 900 ms time width, with a period of 20 s. With a fixed Vx = 0.5 V, Id can be switched between high and low current levels, corresponding to programmed and erased states, respectively. In contrast to the traditional silicon 1T1C DRAM, a destruction to the stored data during the reading operation can be avoided in our 2T1C MoS2 DRAM, benefiting from such non-destructive readout characteristic. Thus our MoS2 2T1C structure circuit can realize a continuous non-destructive reading, which is more suitable for in-memory computing applications [27].

We then characterized the properties of the 2T1C storage module. Figure 3A shows the input pulse signals of Vw and Vre during the storage operation. T1 was turned on by applying an active Vre (3 V), then Vw was applied to charge the capacitor. After the storage operation, T1 was then turned off by applying a negative Vre (−3 V), and Vw was set to 0 V. Figure 3B shows the measured Id-t curves, in which Vx was fixed at 0.1 V, and the Vw pulse increased from 0.5 to 2 V with a 0.5 V increment. Then, Vx was fixed for each Id-t curve to monitor the current Id over one period. After holding for 9 s, different Id-t curves could still be clearly distinguished.

Thus, when Vw is stored in the capacitor as a weight value, it also controlled the channel conductance of T2 as a gate input voltage. Thanks to the excellent linearity and wide current range in the graphene transistor T2, it has a more distinguishable output than other nonlinear devices (Figures 1D and 1E), therefore our 2T1C DRAM could also realize multi-level storage and computing without complex linear modulation circuits [28]. Figure 3C shows a multi-step voltage waveform for Vx, which ranged from 0 to 0.4 V with a 50 mV increment and duration of 0.5 s. By applying such incremental step programming pulses (ISPPs) for Vx, Figure 3D shows Id-t curves with five levels. Meanwhile, five levels of Vw were also applied from 0 to 2 V with an increment of 0.5 V, corresponding to the curves with different colors in Figure 3D. The above results can be summarized as, the output current Id for different Vw values could be distinguished under the same Vx, and the current values for different Vx values could also be distinguished for the same Vw. The dynamic measurement for more than 2 bits was demonstrated in Figure S5. The results in Figure S5a showed that Id is separated into 8 levels and can still be effectively distinguished after 9 s. This confirms that our device has the ability of 3-bit storage. The dynamic measurement results demonstrate that our 2T1C DRAM has the ability to realize 6 bits multiplication.

Multiplication is the most essential computational function in an FNN. Combining the long retention time of our 2T1C DRAM and the linear characteristic of T2, in-memory multiplication can be implemented. We note that the current did not converge to zero when both the weight voltage Vw and the input voltage Vx were 0 V due to the intrinsic semimetallic behavior of the graphene whose conductance could not be completely turned off. Therefore, a current differential circuit was designed for the 2T1C DRAM, as shown in Figure 4A. The current Id of T2 is the input to the current differential module, which subtracts the differential current I0 to obtain the output current Iout. The detailed circuit diagram of the current differential is shown on the lower left of Figure 4A. The current differential is composed of three parts, in which the red box is a current mirror to duplicate the input current (Id = I′d), while the transistors in the blue and green boxes (T3 and T4) are connected in parallel. The current differential circuit is heterogeneously integrated by two channel materials. The current mirror is composed of two MoS2 transistors of the same size, while the graphene transistors T3 and T4 provide the differential current and output current, respectively. The same as T2, and the drain of T3 and T4 are all connected to the input voltage Vx. The channel width-to-length ratio of T3 is half that of T2, and the gate of T3 is connected to the gate of T2 to ensure that T3 and T4 obtain the same drain-to-source voltage Vd as that of T2 in the working range of Vw (from 0 to 2 V). T4 is exactly equivalent to T2, and its gate is applied with a voltage of 0 V to offer a differential current I0 of the graphene transistor at Vg = 0 V. According to Kirchhoff’s current law, the output current through T3 is Iout = IdI0.

thumbnail Figure 4

Implementation of multiplication operation. (A) Circuit diagram of current differential designed for the 2T1C DRAM. (B) Simulated Iout values as a function of the calculated Vw × Vx values. (C) Colormap of standard multiplication operation. (D) Colormap of Iout as a function of Vw and Vx.

Figure 4B shows the simulated Iout values as a function of their corresponding Vw × Vx values. Iout exhibited a linear dependence on Vw × Vx. We plotted the two-dimensional contour map of the standard multiplication operation (Z = X × Y) as shown in Figure 4C. The values of X and Y ranged from 0 to 0.5 and 0 to 2, respectively. The value of Z increased with the increase in X and Y, and it reached the maximum value at X = 2 and Y = 2. A two-dimensional colormap of Iout with different applied Vx and Vw values is shown in Figure 4D. The variation of Z with X and Y showed a practically identical trend to the variation of Iout with Vx and Vw. Thus, we have experimentally verified that the analog multiplication operation could be performed with our 2T1C DRAM. Taking advantage of the high on/off ratio of MoS2 transistors (107) and the linear output and transfer characteristics of graphene transistors described above, our 2T1C DRAM can simultaneously implement 2-bit memory with a retention time of more than 9 s as well as analog multiplication operations. Thus, in one retention period, the multiplication operations could be achieved by applying a set of short Vx pulses to the drain of the T2 transistor, using the stored voltage Vw and the Vx pulses.

Thus, we have experimentally demonstrated long-time retention and the multi-bit storage capability of our 2T1C DRAM with non-destructive access, and the graphene transistor could be used to accomplish computing. Owing to the ideal cycling endurance of the DRAM structure, our 2T1C DRAM has the potential to be used for in situ training and recognition, rather than off-chip training, followed by mapping the weights to the device, which can significantly improve the recognition accuracy of the neural network. In addition, 2D layered semiconductors allow the stacking of materials with different properties together [29,30], which enables the monolithic integration of the 2T1C DRAM based on MoS2 and graphene transistors. Therefore, our study supports the application of 2D layered semiconductors for future post-Moore applications.

Finally, we constructed a three-layer FNN to implement letter recognition. As shown in Figure 5A, the input layer contained 25 neurons corresponding to the gray values of 5 × 5 pixel images. The standard images of the letters to be recognized were “F,” “D,” and “U,” as shown on the left of Figure 5A. The input images of the FNN were obtained by adding 20% random noise to the standard gray values of the letters. The output layer contained three neurons corresponding to the output values of “F,” “D,” and “U,” and the hidden layer contained 9 neurons. The input and hidden layers were connected by 25 × 9 synapses denoted as a weight matrix . The hidden and output layers were connected by 9 × 3 synapses denoted as a weight matrix . Each synapse corresponded to one 2T1C DRAM cell, which consisted of five-level weights. The circuit diagram of our 2T1C DRAM simulating the m × n synapse array is shown in Figure 5B. This array stored an m × n weight matrix, which was written through bit lines under the control of refresh lines. The output current value of each line is

thumbnail Figure 5

Fully connected neural network (FNN) for letter recognition. (A) Schematic diagram of FNN for letter recognition. The 5 × 5 pixel image of the letters “F,” “D,” and “U.” The grayscale values of the pixels were the inputs of the FNN. (B) Circuit diagram of an m × n array composed of 2T1C DRAM cells. (C) Recognition rate as a function of the training epochs. The inset shows the distribution of the 25 × 9 weights after 20 training epochs. (D) Colormaps of weights before and after training. These five-level weights represent the weight voltages Vw written to the 2T1C DRAM cells.

where is the sum of the current values of the n devices in each line. I0 represents the differential current, which depends on the value of n.

We then trained our simulation model with 1000 images, while another 1000 images were used as the test set. During the training process, the backpropagation algorithm was applied to train the model [31]. As shown in Figure 5C, the average recognition accuracy reached 84% within 20 training epochs. The weight values were quantized into 5 weight voltage levels which were the Vw written to the 2T1C DRAM. The inset of Figure 5C shows the weight voltages distribution of the 25 × 9 weights after 20 training epochs. Figure 5D shows a colormap of the weight voltages before and after training.

CONCLUSION

In conclusion, we utilized a 2T1C DRAM structure consisting of MoS2 and graphene heterogeneous integration to accomplish a multi-bit voltage storage function. Owing to the low leakage current of the MoS2 transistor, the stored electric charge could be retained for a long time, and meanwhile accurately modulate the conductance levels of the graphene transistor. Because of the linear characteristic of the graphene transistor’s output curve, analog multiplication could be successfully implemented. In contrast to the conventional 1T1C DRAM, our 2T1C DRAM could access the stored multi-bit information nondestructively. Finally, a 25 × 9 × 3 FNN was built based on our 2T1C DRAM, and the average recognition accuracy of the letters “F,” “D,” and “U” reached 84% within 20 training epochs. Considering the almost unlimited cycling endurance of DRAM, our 2T1C DRAM structure exhibits remarkable potential for in-memory computing.

METHODS

Device fabrication

As shown in Figure 1A, wafer-scale monolayer MoS2 grown on a sapphire substrate using the chemical vapor deposition (CVD) method was used as the channel material of the T1 transistor. The contact electrodes of T1, the bottom plate of the capacitor, and the back gate of T2 were patterned by photolithography, followed by depositing a 40 nm Au layer using electron beam evaporation (EBE). After this, the excess MoS2 beyond the channel area was etched by inductively coupled plasma (ICP). A 16 nm HfO2 high-k dielectric layer was grown by atomic layer deposition (ALD). The wafer-scale monolayer graphene grown by CVD was then transferred onto the dielectric layer as the channel of T2. The 40 nm Au top gate electrode of T1, the top plate of the capacitor, and the contact electrodes of T2 were fabricated by photolithography and formed by resistive thermal evaporation.

Electrical measurements

All measurements were performed in an ambient environment at room temperature. The MoS2 and graphene FETs were characterized using a semiconductor parameter analyzer (Agilent B1500A). For dynamic measurements and characterization of 2T1C DRAM cell, the Agilent B1500A was used to provide voltage signals and detect the output current. A waveform generator (Aligent 33260A) was also used to supply voltage waveforms to the test circuit, while an oscilloscope (DS 1054Z) was used for capturing output signal voltage.

Neural network implementation

In FNN simulation, the training process consisted of forward and backpropagation. The output values of the hidden layer are , where xi is one of the pixel grayscale values of the input image, is the weight value between input neuron i and hidden neuron j, and refers to the bias of the hidden neuron. During backpropagation operation, the weight values between input layer and hidden layer were updated by , where the η=0.01, and is the calculated error.

Data availability

The original data are available from corresponding authors upon reasonable request.

Funding

This work was supported by the National Key Research and Development Program (2021YFA1200500), in part by the Innovation Program of Shanghai Municipal Education Commission (2021-01-07-00-07-E00077), and Shanghai Municipal Science and Technology Commission (21DZ1100900).

Author contributions

W.B., P.Z. and Y.X. initiated and supervised the project. S.G. and Y.W. designed the experiments and analyzed the data. X.D. contributed to the idea of the current differential circuit. X.W. and Q.S. were involved in device fabrications and electrical measurements. S.G. and Y.W. wrote the manuscript. All authors read and contributed to the manuscript revision.

Conflict of interest

The authors declare no conflict of interest.

Supplementary information

The supporting information is available online at https://doi.org/10.1360/nso/20220071. The supporting materials are published as submitted, without typesetting or editing. The responsibility for scientific accuracy and content remains entirely with the authors.

References

  • LeCun Y, Bengio Y, Hinton G. Deep learning. Nature 2015; 521: 436–444 [Google Scholar]
  • Wulf W, McKee S. Hitting the memory wall: implications of the obvious. Comput Architect News 1996; 23: 20–24 [Google Scholar]
  • Wong HSP, Salahuddin S. Memory leads the way to better computing. Nat Nanotech 2015; 10: 191-194. [Article] [NASA ADS] [CrossRef] [Google Scholar]
  • Sun X, Zhu C, Yi J, et al. Reconfigurable logic-in-memory architectures based on a two-dimensional van der Waals heterostructure device. Nat Electron 2022; 5: 752-760. [Article] [CrossRef] [MathSciNet] [Google Scholar]
  • Horowitz M. Computing’s energy problem (and what we can do about it). In: Proceedings of the 2014 IEEE International Solid-State Circuits Conference Digest of Technical Papers (ISSCC). IEEE, San Francisco, 2014 [Google Scholar]
  • Li S, Niu D, Malladi KT, et al. DRISA: A DRAM-based reconfigurable in-situ accelerator. In: Proceedings of the 2017 50th Annual IEEE/ACM International Symposium on Microarchitecture (MICRO). IEEE, Boston, 2017 [Google Scholar]
  • Chen Z, Chen X, Gu J. A 65 nm 3T dynamic analog RAM-based computing-in-memory macro and CNN accelerator with retention enhancement, adaptive analog sparsity and 44 TOPS/W system energy efficiency. In: Proceedings of the 2021 IEEE International Solid-State Circuits Conference (ISSCC). IEEE, San Francisco, 2021 [Google Scholar]
  • Shin H, Sim J, Lee D, et al. A PVT-robust customized 4T embedded DRAM cell array for accelerating binary neural networks. In: Proceedings of the 2019 IEEE/ACM International Conference on Computer-Aided Design (ICCAD). IEEE, Westminster, 2019 [Google Scholar]
  • Yu C, Yoo T, Kim H, et al. A logic-compatible eDRAM compute-in-memory with embedded ADCs for processing neural networks. IEEE Trans Circuits Syst I 2021; 68: 667-679. [Article] [CrossRef] [Google Scholar]
  • Graves A, Wayne G, Reynolds M, et al. Hybrid computing using a neural network with dynamic external memory. Nature 2016; 538: 471-476. [Article] [NASA ADS] [CrossRef] [PubMed] [Google Scholar]
  • Wang P, Xu F, Wang B, et al. Three-dimensional nand flash for vector-matrix multiplication. IEEE Trans VLSI Syst 2019; 27: 988-991. [Article] [CrossRef] [Google Scholar]
  • Wu L, Wang A, Shi J, et al. Atomically sharp interface enabled ultrahigh-speed non-volatile memory devices. Nat Nanotechnol 2021; 16: 882-887. [Article] [NASA ADS] [CrossRef] [PubMed] [Google Scholar]
  • Liu L, Liu C, Jiang L, et al. Ultrafast non-volatile flash memory based on van der Waals heterostructures. Nat Nanotechnol 2021; 16: 874-881. [Article] [NASA ADS] [CrossRef] [Google Scholar]
  • Ielmini D, Wong HSP. In-memory computing with resistive switching devices. Nat Electron 2018; 1: 333-343. [Article] [CrossRef] [Google Scholar]
  • Yao P, Wu H, Gao B, et al. Fully hardware-implemented memristor convolutional neural network. Nature 2020; 577: 641-646. [Article] [NASA ADS] [CrossRef] [PubMed] [Google Scholar]
  • Hickmott TW. Low-frequency negative resistance in thin anodic oxide films. J Appl Phys 1962; 33: 2669–2682 [CrossRef] [Google Scholar]
  • Bichler O, Suri M, Querlioz D, et al. Visual pattern extraction using energy-efficient “2-PCM synapse” neuromorphic architecture. IEEE Trans Electron Devices 2012; 59: 2206-2214. [Article] [Google Scholar]
  • Wright CD, Hosseini P, Diosdado JAV. Beyond von-neumann computing with nanoscale phase-change memory devices. Adv Funct Mater 2013; 23: 2248-2254. [Article] [Google Scholar]
  • Kim KM, Xu N, Shao X, et al. Single-cell stateful logic using a dual-bit memristor. Phys Status Solidi (RRL)-Rapid Res Lett 2019; 13: 1800629 [NASA ADS] [CrossRef] [Google Scholar]
  • Sebastian A, Le Gallo M, Khaddam-Aljameh R, et al. Memory devices and applications for in-memory computing. Nat Nanotechnol 2020; 15: 529-544. [Article] [NASA ADS] [CrossRef] [PubMed] [Google Scholar]
  • Sim J, Seol H, Kim LS. NID: Processing binary convolutional neural network in commodity DRAM. In: Proceedings of the 2018 IEEE/ACM International Conference on Computer-Aided Design (ICCAD). IEEE, San Diego, 2018 [Google Scholar]
  • Saito D, Doevenspeck J, Cosemans S, et al. IGZO-based compute cell for analog in-memory computing—DTCO analysis to enable ultralow-power AI at edge. IEEE Trans Electron Devices 2020; 67: 4616-4620. [Article] [Google Scholar]
  • Cosemans S, Verhoef B, Doevenspeck J, et al. Towards 10000TOPS/W DNN inference with analog in-memory computing—A circuit blueprint, device options and requirements. In: Proceedings of the 2019 IEEE International Electron Devices Meeting (IEDM). IEEE, San Francisco, 2019 [Google Scholar]
  • Deng Q, Jiang L, Zhang YT, et al. DrAcc: a DRAM based accelerator for accurate CNN inference. In: Proceedings of the 2018 55TH ACM/ESDA/IEEE Design Automation Conference (DAC). IEEE, San Francisco, 2018 [Google Scholar]
  • Luo P, Liu C, Lin J, et al. Molybdenum disulfide transistors with enlarged van der Waals gaps at their dielectric interface via oxygen accumulation. Nat Electron 2022; 5: 849-858. [Article] [CrossRef] [Google Scholar]
  • Parrish KN, Akinwande D. Impact of contact resistance on the transconductance and linearity of graphene transistors. Appl Phys Lett 2011; 98: 183505. [Article] [NASA ADS] [CrossRef] [Google Scholar]
  • Wang Y, Tang H, Xie Y, et al. An in-memory computing architecture based on two-dimensional semiconductors for multiply-accumulate operations. Nat Commun 2021; 12: 3347. [Article] [Google Scholar]
  • Zhou Y, Wang Y, Zhuge F, et al. A Reconfigurable Two‐WSe2 ‐Transistor Synaptic Cell for Reinforcement Learning. Adv Mater 2022; 34: 2107754. [Article] [Google Scholar]
  • Yu WJ, Liu Y, Zhou H, et al. Highly efficient gate-tunable photocurrent generation in vertical heterostructures of layered materials. Nat Nanotech 2013; 8: 952-958. [Article] [NASA ADS] [CrossRef] [PubMed] [Google Scholar]
  • Zhang S, Xu H, Liao F, et al. Wafer-scale transferred multilayer MoS2 for high performance field effect transistors. Nanotechnology 2019; 30: 174002. [Article] [NASA ADS] [CrossRef] [PubMed] [Google Scholar]
  • Buscema M. Back propagation neural networks. Substance Use Misuse 1998; 33: 233–270 [CrossRef] [PubMed] [Google Scholar]

All Figures

thumbnail Figure 1

Structure of two-transistor-one-capacitor (2T1C) dynamic random-access memory (DRAM) cell and electrical characteristics of MoS2 and graphene transistors. (A) Schematic of our 2T1C DRAM cell fabrication process and optical microscope image of the fabricated device; scale bar: 100 μm. (B) Circuit diagram of 2T1C DRAM containing storage and calculation modules. (C) Transfer curve of the MoS2 field-effect transistor (FET). (D) Transfer and (E) output curves of the graphene FET.

In the text
thumbnail Figure 2

Dynamic programming and erasing operation. Input signals of Vre and Vw in (A) programming and (B) erasing operations. Retention characteristics (C) Vc and (D) Id over 9 s after programming/erasing operations. (E) Dynamic writing and reading by sequential programming and erasing pulses with 9 s interval.

In the text
thumbnail Figure 3

Characterization of the 2T1C unit cell. (A) Incremental step programming pulse (ISPP) waveform of Vw, varying from 0.5 to 2 V. Input pulse waveform of Vre. (B) Drain current Id of T2 with Vw varying from 0.5 to 2 V, where Vx = 0.1 V. (C) ISPP waveform applied to Vx, ranging from 0.1 to 0.4 V with 0.1 V increments. The pulse width was 0.1 s. (D) Corresponding Id curves when Vw was fixed at a series of values.

In the text
thumbnail Figure 4

Implementation of multiplication operation. (A) Circuit diagram of current differential designed for the 2T1C DRAM. (B) Simulated Iout values as a function of the calculated Vw × Vx values. (C) Colormap of standard multiplication operation. (D) Colormap of Iout as a function of Vw and Vx.

In the text
thumbnail Figure 5

Fully connected neural network (FNN) for letter recognition. (A) Schematic diagram of FNN for letter recognition. The 5 × 5 pixel image of the letters “F,” “D,” and “U.” The grayscale values of the pixels were the inputs of the FNN. (B) Circuit diagram of an m × n array composed of 2T1C DRAM cells. (C) Recognition rate as a function of the training epochs. The inset shows the distribution of the 25 × 9 weights after 20 training epochs. (D) Colormaps of weights before and after training. These five-level weights represent the weight voltages Vw written to the 2T1C DRAM cells.

In the text

Current usage metrics show cumulative count of Article Views (full-text article views including HTML views, PDF and ePub downloads, according to the available data) and Abstracts Views on Vision4Press platform.

Data correspond to usage on the plateform after 2015. The current usage metrics is available 48-96 hours after online publication and is updated daily on week days.

Initial download of the metrics may take a while.