Speaker diarization speech recognition