The prediction of future air quality and its responses to emission control strategies at national and state levels requires a reliable model that can replicate atmospheric observations. In this work, the Mesoscale Model (MM5) and the Community Multiscale Air Quality Modeling (CMAQ) system are applied at a 4-km horizontal grid resolution for four one-month periods, i.e., January, June, July, and August in 2002 to evaluate model performance and compare with that at 12-km. The evaluation shows skills of MM5/CMAQ that are overall consistent with current model performance. The large cold bias in temperature at 1.5 m is likely due to too cold soil initial temperatures and inappropriate snow treatments. The large overprediction in precipitation in July is due likely to too frequent afternoon convective rainfall and/or an overestimation in the rainfall intensity. The normalized mean biases and errors are -1.6% to 9.1% and 15.3-18.5% in January and -18.7% to -5.7% and 13.9-20.6% in July for max 1-h and 8-h O-3 mixing ratios, respectively, and those for 24-h average PM2.5 concentrations are 8.3-25.9% and 27.6-38.5% in January and -57.8% to -45.4% and 46.1-59.3% in July. The large underprediction in PM2.5 in summer is attributed mainly to overpredicted precipitation, inaccurate emissions, incomplete treatments for secondary organic aerosols, and model difficulties in resolving complex meteorology and geography. While O-3 prediction shows relatively less sensitivity to horizontal grid resolutions, PM2.5 and its secondary components, visibility indices, and dry and wet deposition show a moderate to high sensitivity. These results have important implications for the regulatory applications of MM5/CMAQ for future air quality attainment. (C) 2010 Elsevier Ltd. All rights reserved.