當前位置：首頁 > 编程资源 > 编程问答 >内容正文

编程问答

直播技术（从服务端到客户端）二

發布時間：2025/7/25 编程问答 33 豆豆

生活随笔收集整理的這篇文章主要介紹了直播技术（从服务端到客户端）二小編覺得挺不錯的,現在分享給大家,幫大家做個參考.

播放

在上一篇文章中，我們敘述了直播技術的環境配置（包括服務端nginx，nginx-rtmp-module， ffmpeg, Android編譯，iOS編譯）。從本文開始，我們將敘述播放相關的東西，播放是直播技術中關鍵的一步，它包括很多技術如：解碼，縮放，時間基線選擇，緩存隊列，畫面渲染，聲音播放等等。我將分為三個部分為大家講述整個播放流程；

Android

第一部分是基于NativeWindow的視頻渲染，主要使用的OpenGL ES2通過傳入surface來將視頻數據渲染到surface上顯示出來。第二部分是基于OpenSL ES來音頻播放。第三部分，音視頻同步。我們使用的都是android原生自帶的一些庫來做音視頻渲染處理。
IOS

同樣IOS也分成三個部分，第一部分視頻渲染：使用OpenGLES.framework，通過OpenGL來渲染視頻畫面，第二部分是音頻播放，基于AudioToolbox.framework做音頻播放；第三部分，視音頻同步。

利用原生庫可以減少資源的利用，降低內存，提高性能；一般而言，如果不是通曉android、ios的程序員會選擇一個統一的視頻顯示和音頻播放庫（SDL），這個庫可以實現視頻顯示和音頻播。但是增加額外的庫意味著資源的浪費和性能的降低。

Android

我們首先帶來android端的視頻播放功能，我們分成三個部分，1、視頻渲染；2、音頻播放；3、時間基線（音視頻同步）來闡述。

1、視頻渲染

ffmpeg為我們提供瀏覽豐富的編解碼類型（ffmpeg所具備編解碼能力都是軟件編解碼，不是指硬件編解碼。具體之后文章會詳細介紹ffmpeg），視頻解碼包括flv, mpeg, mov 等；音頻包括aac, mp3等。對于整個播放，FFmpeg主要處理流程如下：

<code class="language-C++ hljs scss has-numbering"> av_register_all(); // 注冊所有的文件格式和編解碼器的庫,打開的合適格式的文件上會自動選擇相應的編解碼庫avformat_network_init(); // 注冊網絡服務avformat_alloc_context(); // 分配FormatContext內存，avformat_open_input(); // 打開輸入流，獲取頭部信息，配合av_close_input_file（）關閉流avformat_find_stream_info(); // 讀取packets，來獲取流信息，并在pFormatCtx->streams 填充上正確的信息avcodec_find_decoder(); // 獲取解碼器，avcodec_open2(); // 通過AVCodec來初始化AVCodecContextav_read_frame(); // 讀取每一幀avcodec_decode_video2(); // 解碼幀數據avcodec_close(); // 關閉編輯器上下文avformat_close_input(); // 關閉文件流</code>

我們先來看一段代碼：

<code class="language-C++ hljs php has-numbering">av_register_all(); avformat_network_init(); pFormatCtx = avformat_alloc_context(); if (avformat_open_input(&pFormatCtx, pathStr, NULL, NULL) != 0) {LOGE("Couldn't open file: %s\n", pathStr);return; }if (avformat_find_stream_info(pFormatCtx, &dictionary) < 0) {LOGE("Couldn't find stream information.");return; } av_dump_format(pFormatCtx, 0, pathStr, 0); </code>

這段代碼可以算是初始化FFmpeg，首先注冊編解碼庫，為FormatContext分配內存，調用avformat_open_input打開輸入流，獲取頭部信息，配合avformat_find_stream_info來填充FormatContext中相關內容，av_dump_format這個是dump出流信息。這個信息是這個樣子的：

<code class="language-text hljs lasso has-numbering">video infomation： Input #0, flv, from 'rtmp:127.0.0.1:1935/live/steam':Metadata:Server : NGINX RTMP (github.com/sergey-dryabzhinsky/nginx-rtmp-module)displayWidth : 320displayHeight : 240fps : 15profile : level : Duration: 00:00:00.00, start: 15.400000, bitrate: N/AStream #0:0: Video: flv1 (flv), yuv420p, 320x240, 15 tbr, 1k tbn, 1k tbcStream #0:1: Audio: mp3, 11025 Hz, stereo, s16p, 32 kb/s</code>

整個音頻播放流暢其實看起來也是很簡單的，主要分：1、創建實現播放引擎；2、創建實現混音器；3、設置緩沖和pcm格式；4、創建實現播放器；5、獲取音頻播放器接口；6、獲取緩沖buffer；7、注冊播放回調；8、獲取音效接口；9、獲取音量接口；10、獲取播放狀態接口；
做完這10步，整個音頻播放器引擎就創建完畢，接下來就是引擎讀取數據播放。

<code class="language-C++ hljs objectivec has-numbering">void playBuffer(void *pBuffer, int size) {// 判斷數據可用性if (pBuffer == NULL || size == -1) {return;}LOGV("PlayBuff!");// 數據存放進bqPlayerBufferQueue中SLresult result = (*bqPlayerBufferQueue)->Enqueue(bqPlayerBufferQueue,pBuffer, size);if (result != SL_RESULT_SUCCESS)LOGE("Play buffer error!"); }</code>

這段代碼主要闡述的播放的過程，通過將數據放進bqPlayerBufferQueue，供播放引擎讀取播放。記得我們在創建緩沖buffer的時候，注冊了一個callback，這個callBack的作用就是通知可以向緩沖隊列中添加數據，這個callBack的原型如下：

<code class="hljs lasso has-numbering">void videoPlayCallBack(SLAndroidSimpleBufferQueueItf bq, void *context) {// 添加數據到bqPlayerBufferQueue中，通過調用playBuffer方法。void* data = getData();int size = getDataSize();playBuffer(data, size); }</code>
<code class="hljs cpp has-numbering">typedef struct PlayInstance {ANativeWindow *window; // nativeWindow // 通過傳入surface構建int display_width; // 顯示寬度int display_height; // 顯示高度int stop; // 停止int timeout_flag; // 超時標記int disable_video; VideoState *videoState; //隊列struct ThreadQueue *queue; // 音視頻幀隊列struct ThreadQueue *video_queue; // 視頻幀隊列struct ThreadQueue *audio_queue; // 音頻幀隊列} PlayInstance;</code>

我們主要分析延時同步的那一段代碼：

<code class="hljs autohotkey has-numbering">// 延時同步int64_t pkt_pts = pavpacket.pts;double show_time = pkt_pts * (playInstance->videoState->video_time_base);int64_t show_time_micro = show_time * 1000000;int64_t played_time = av_gettime() - playInstance->videoState->video_start_time;int64_t delta_time = show_time_micro - played_time;if (delta_time < -(0.2 * 1000000)) {LOGE("視頻跳幀\n");continue；} else if (delta_time > 0.2 * 1000000) {av_usleep(delta_time);}</code>

這是一段Swift代碼。在ios采用的是swift+oc+c++混合編譯，正好借此熟悉swift于oc和c++的交互。enableAudio主要是創建一個audioManager實例，進行注冊回調，和開始播放和暫停服務。audioManager是一個單例。是一個封裝AudioToolbox類。下面的代碼是激活AudioSession（初始化Audio）和失效AudioSession代碼。

<code class="language-oc hljs objectivec has-numbering">- (BOOL) activateAudioSession {if (!_activated) {if (!_initialized) {if (checkError(AudioSessionInitialize(NULL,kCFRunLoopDefaultMode,sessionInterruptionListener,(__bridge void *)(self)),"Couldn't initialize audio session"))return NO;_initialized = YES;}if ([self checkAudioRoute] &&[self setupAudio]) {_activated = YES;}}return _activated; }- (void) deactivateAudioSession {if (_activated) {[self pause];checkError(AudioUnitUninitialize(_audioUnit),"Couldn't uninitialize the audio unit");/*fails with error (-10851) ? checkError(AudioUnitSetProperty(_audioUnit,kAudioUnitProperty_SetRenderCallback,kAudioUnitScope_Input,0,NULL,0),"Couldn't clear the render callback on the audio unit");*/checkError(AudioComponentInstanceDispose(_audioUnit),"Couldn't dispose the output audio unit");checkError(AudioSessionSetActive(NO),"Couldn't deactivate the audio session"); checkError(AudioSessionRemovePropertyListenerWithUserData(kAudioSessionProperty_AudioRouteChange,sessionPropertyListener,(__bridge void *)(self)),"Couldn't remove audio session property listener");checkError(AudioSessionRemovePropertyListenerWithUserData(kAudioSessionProperty_CurrentHardwareOutputVolume,sessionPropertyListener,(__bridge void *)(self)),"Couldn't remove audio session property listener");_activated = NO;} }- (BOOL) setupAudio {// --- Audio Session Setup ---UInt32 sessionCategory = kAudioSessionCategory_MediaPlayback;//UInt32 sessionCategory = kAudioSessionCategory_PlayAndRecord;if (checkError(AudioSessionSetProperty(kAudioSessionProperty_AudioCategory,sizeof(sessionCategory),&sessionCategory),"Couldn't set audio category"))return NO;if (checkError(AudioSessionAddPropertyListener(kAudioSessionProperty_AudioRouteChange,sessionPropertyListener,(__bridge void *)(self)),"Couldn't add audio session property listener")){// just warning}if (checkError(AudioSessionAddPropertyListener(kAudioSessionProperty_CurrentHardwareOutputVolume,sessionPropertyListener,(__bridge void *)(self)),"Couldn't add audio session property listener")){// just warning}// Set the buffer size, this will affect the number of samples that get rendered every time the audio callback is fired// A small number will get you lower latency audio, but will make your processor work harder#if !TARGET_IPHONE_SIMULATORFloat32 preferredBufferSize = 0.0232;if (checkError(AudioSessionSetProperty(kAudioSessionProperty_PreferredHardwareIOBufferDuration,sizeof(preferredBufferSize),&preferredBufferSize),"Couldn't set the preferred buffer duration")) {// just warning} #endifif (checkError(AudioSessionSetActive(YES),"Couldn't activate the audio session"))return NO;[self checkSessionProperties];// ----- Audio Unit Setup -----// Describe the output unit.AudioComponentDescription description = {0};description.componentType = kAudioUnitType_Output;description.componentSubType = kAudioUnitSubType_RemoteIO;description.componentManufacturer = kAudioUnitManufacturer_Apple;// Get componentAudioComponent component = AudioComponentFindNext(NULL, &description);if (checkError(AudioComponentInstanceNew(component, &_audioUnit),"Couldn't create the output audio unit"))return NO;UInt32 size;// Check the output stream formatsize = sizeof(AudioStreamBasicDescription);if (checkError(AudioUnitGetProperty(_audioUnit,kAudioUnitProperty_StreamFormat,kAudioUnitScope_Input,0,&_outputFormat,&size),"Couldn't get the hardware output stream format"))return NO;_outputFormat.mSampleRate = _samplingRate;if (checkError(AudioUnitSetProperty(_audioUnit,kAudioUnitProperty_StreamFormat,kAudioUnitScope_Input,0,&_outputFormat,size),"Couldn't set the hardware output stream format")) {// just warning}_numBytesPerSample = _outputFormat.mBitsPerChannel / 8;_numOutputChannels = _outputFormat.mChannelsPerFrame;LoggerAudio(2, @"Current output bytes per sample: %ld", _numBytesPerSample);LoggerAudio(2, @"Current output num channels: %ld", _numOutputChannels);// Slap a render callback on the unitAURenderCallbackStruct callbackStruct;callbackStruct.inputProc = renderCallback; // 注冊回調，這個回調是用來取數據的，也就是callbackStruct.inputProcRefCon = (__bridge void *)(self);if (checkError(AudioUnitSetProperty(_audioUnit,kAudioUnitProperty_SetRenderCallback,kAudioUnitScope_Input,0,&callbackStruct,sizeof(callbackStruct)),"Couldn't set the render callback on the audio unit"))return NO;if (checkError(AudioUnitInitialize(_audioUnit),"Couldn't initialize the audio unit"))return NO;return YES; }</code>

總結

本文主要是講述了ffmpeg實現播放的邏輯，分為android和ios兩端，根據兩端平臺的特性做了相應的處理。在android端采用的是NativeWindow（surface）實現視頻播放，OpenSL ES實現音頻播放。實現音視頻同步的邏輯是基于第三方時間基準線，音頻和視頻同時調整的方案。在ios端采用的是OpenGL實現視頻渲染，AudioToolbox實現音頻播放。音視頻同步和android采用的是一樣。其中兩端的ffmpeg邏輯是一致的。在ios端OpenGL實現視頻渲染沒有重點闡述如何使用OpenGL。這個有興趣的同學可以自行研究。
備注：整個代碼工程等整理之后會發布出來。
最后添加兩張播放效果圖

總結

以上是生活随笔為你收集整理的直播技术（从服务端到客户端）二的全部內容，希望文章能夠幫你解決所遇到的問題。

如果覺得生活随笔網站內容還不錯，歡迎將生活随笔推薦給好友。

上一篇：神经网络架构演进史：全面回顾从LeNet
下一篇：行人检测资源综述文献

3atv精品不卡视频,97人人超碰国产精品最新,中文字幕av一区二区三区人妻少妇,久久久精品波多野结衣,日韩一区二区三区精品

编程问答

直播技术（从服务端到客户端）二

播放

Android

1、視頻渲染

總結

總結