Besides Andrew Ng’s lecture, a good discussion can be found here:
https://towardsdatascience.com/word2vec-skip-gram-model-part-1-intuition-78614e4d6e0b
https://towardsdatascience.com/word2vec-skip-gram-model-part-2-implementation-in-tf-7efdf6f58a27