The theoretical Assessment demonstrates that EDIS reveals decreased suboptimality when compared to entirely utilizing on line details or specifically reusing offline data. EDIS is often a plug-in method and might be combined with present procedures in offline-to-online RL environment. By applying EDIS to off-the-shelf strategies Cal-QL and IQL, we