While it is not known for sure what is happening, this is how current models of memory recall would explain it:
Memory recall in the brain is not like retrieving a file from disk on a computer. In the brain, memories are reconstructed rather than retrieved. The brain is constantly augmenting what is in "working memory" with related information from the past. This is why stream of consciousness and memory recall often work by free association: The information association process is already there and we just make use of it.
When attempting to recall something specific, like a name, we "trick" the name into appearing in working memory by thinking about concepts related to it: the person's identity, when we saw them last, what they look like. Normally this process automatically brings the information into working memory as a side-effect of filling in related facts.
When a word is missing but you "think you know it," what is probably happening is that a lot of information about that word has been reconstructed in working memory, but not enough to trigger the production of the word itself. The presence of related information signals that you've "almost recalled it," but the failure to produce the word shows that the recall is incomplete.
Often when people can't recall a word, someone else can fill it in for them. But sometimes the "tip of the tongue" word does not actually exist. Related words may come to mind and it may seem like there "should be a word" for whatever it is. Thus the tip of the tongue feeling is not infallible.