On Android phones, you can either convert your model to tflite if it is a tensorflow model, or convert to Qualcomm's SNPE SDK DLC model if it is tensorflow or Onyx based, and then run it on the phone.
Both tflite and DLC models can be run on the CPU, or GPU and DSP if your android phone is a qualcomm Snapdragon chipset. A large model will need to get optimized and/or quantized before it is executable on the phone.
​
As for courses, please checkout this Coursera course on deploying models on devices.