Network on Chip (NoC) is an evolving platform for communications related applications, which are executed on a single silicon chip. There are several routing models in NoC architectures, but the accuracy of these models is limited, and the existing models are degraded because of over and under fitting issues. This research introduces the new deep learning-based latency aware predictive routing model for on-chip networks to route packets with better performance and power efficiency. The deep learning model used in this research is a new convolutional residual gated recurrent unit (CRGRU) with queuing theory. Moreover, the source and channel queuing delay is comprised of features to learn spatial and sequential information that improves the overall prediction accuracy. This router is modified by the intrusion of the Router States Monitor unit and the CRGRU hardware engine. The work is executed using the Xilinx platform, and the performance measures like latency and throughput are obtained by varying the network size as 4x4$$ 4\times 4 $$, 8x8$$ 8\times 8 $$, and 12x12$$ 12\times 12 $$ and also varying the buffer space and length as L=4,B=9$$ L=4,B=9 $$, L=9,B=4$$ L=9,B=4 $$, and L=14,B=3$$ L=14,B=3 $$, respectively. In addition, the squared correlation coefficient (SCC) and normalized root mean square error (NRMSE) are evaluated and compared with existing learning models to validate the proposed model. This research focused on the design of convolutional residual gated recurrent unit (CRGRU) with queuing theory model for routing the data packets with better performance and power efficiency.The source and channel queuing delay are comprised as features to learn spatial and sequential information that improves the overall prediction accuracy.Moreover, wormhole router and micro-architecture are utilized and modified by the intrusion of the router states monitor unit and CRGRU engine model.image