In data-driven decision-making, knowledge transfer can help address data scarcity in new ventures.The authors propose a framework of Transferred Fitted $Q$-Iteration algorithm for knowledge transfer.The framework enables direct estimation of the optimal action-state function using both target and source data.The approach shows improved learning error rates compared to single task learning, both theoretically and empirically.