+

CN110266984B - A PTZ camera intelligent analysis teaching recording and broadcasting integrated machine - Google Patents

A PTZ camera intelligent analysis teaching recording and broadcasting integrated machine Download PDF

Info

Publication number
CN110266984B
CN110266984B CN201910588415.8A CN201910588415A CN110266984B CN 110266984 B CN110266984 B CN 110266984B CN 201910588415 A CN201910588415 A CN 201910588415A CN 110266984 B CN110266984 B CN 110266984B
Authority
CN
China
Prior art keywords
camera
video
module
teacher
audio
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Expired - Fee Related
Application number
CN201910588415.8A
Other languages
Chinese (zh)
Other versions
CN110266984A (en
Inventor
王宣银
杜双龙
莫奇
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Zhejiang University ZJU
Original Assignee
Zhejiang University ZJU
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Zhejiang University ZJU filed Critical Zhejiang University ZJU
Priority to CN201910588415.8A priority Critical patent/CN110266984B/en
Publication of CN110266984A publication Critical patent/CN110266984A/en
Application granted granted Critical
Publication of CN110266984B publication Critical patent/CN110266984B/en
Expired - Fee Related legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Images

Classifications

    • GPHYSICS
    • G06COMPUTING OR CALCULATING; COUNTING
    • G06VIMAGE OR VIDEO RECOGNITION OR UNDERSTANDING
    • G06V20/00Scenes; Scene-specific elements
    • G06V20/40Scenes; Scene-specific elements in video content
    • G06V20/41Higher-level, semantic clustering, classification or understanding of video scenes, e.g. detection, labelling or Markovian modelling of sport events or news items
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N21/00Selective content distribution, e.g. interactive television or video on demand [VOD]
    • H04N21/20Servers specifically adapted for the distribution of content, e.g. VOD servers; Operations thereof
    • H04N21/21Server components or server architectures
    • H04N21/218Source of audio or video content, e.g. local disk arrays
    • H04N21/2187Live feed
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N21/00Selective content distribution, e.g. interactive television or video on demand [VOD]
    • H04N21/40Client devices specifically adapted for the reception of or interaction with content, e.g. set-top-box [STB]; Operations thereof
    • H04N21/47End-user applications
    • H04N21/472End-user interface for requesting content, additional data or services; End-user interface for interacting with content, e.g. for content reservation or setting reminders, for requesting event notification, for manipulating displayed content
    • H04N21/47202End-user interface for requesting content, additional data or services; End-user interface for interacting with content, e.g. for content reservation or setting reminders, for requesting event notification, for manipulating displayed content for requesting content on demand, e.g. video on demand
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N23/00Cameras or camera modules comprising electronic image sensors; Control thereof
    • H04N23/60Control of cameras or camera modules
    • H04N23/67Focus control based on electronic image sensor signals
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N5/00Details of television systems
    • H04N5/76Television signal recording

Landscapes

  • Engineering & Computer Science (AREA)
  • Multimedia (AREA)
  • Signal Processing (AREA)
  • Databases & Information Systems (AREA)
  • Computational Linguistics (AREA)
  • Software Systems (AREA)
  • Physics & Mathematics (AREA)
  • General Physics & Mathematics (AREA)
  • Theoretical Computer Science (AREA)
  • Human Computer Interaction (AREA)
  • Closed-Circuit Television Systems (AREA)

Abstract

本发明公开了一种云台摄像智能分析教学录播一体机,包括云台和搭载在云台上的摄像头单元;摄像头单元包括:摄像头,用于采集教学场景的音视频信号;主机处理模块,用于接收音视频信号,对音视频信号进行编码,封装,封装后的音视频一方面储存于本地,另一方面通过4G网络模块将音视频以流媒体形式推出;视频分析DSP模块,用于接收视频信号,对视频信号中的目标进行分类识别,根据分类识别的结果,输出运动控制信号给云台控制模块,输出摄像头对焦信号给摄像头控制模块,从而对目标进行实时跟踪;本发明将摄像头与云台一体化,对课堂场景下特定目标行为的识别与分析,结合4G网络的实时传输,有助于进一步提升教学课堂的智能化程度。

Figure 201910588415

The invention discloses a PTZ camera intelligent analysis teaching recording and broadcasting integrated machine, comprising a PTZ and a camera unit mounted on the PTZ; the camera unit comprises: a camera for collecting audio and video signals of a teaching scene; a host processing module, It is used to receive audio and video signals, encode and encapsulate the audio and video signals. On the one hand, the packaged audio and video are stored locally, and on the other hand, the audio and video are released in the form of streaming media through the 4G network module; the video analysis DSP module is used for Receive the video signal, classify and identify the target in the video signal, output the motion control signal to the pan-tilt control module according to the result of the classification and identification, and output the camera focus signal to the camera control module, so as to track the target in real time; The integration with the PTZ, the identification and analysis of specific target behaviors in classroom scenarios, combined with the real-time transmission of 4G networks, will help to further improve the intelligence of the teaching classroom.

Figure 201910588415

Description

一种云台摄像智能分析教学录播一体机A PTZ camera intelligent analysis teaching recording and broadcasting integrated machine

技术领域technical field

本发明涉及智慧教学领域,尤其是一种云台摄像智能分析教学录播一体机。The invention relates to the field of intelligent teaching, in particular to a pan-tilt camera intelligent analysis and teaching recording and broadcasting integrated machine.

技术背景technical background

随着计算机视觉与网络流媒体技术的不断发展,加之云计算、移动互联网以及大数据分析等技术的不断成熟,其与现代教育的深度融合,催生了新一代智能教学录播系统。With the continuous development of computer vision and network streaming media technology, coupled with the continuous maturity of cloud computing, mobile Internet and big data analysis technologies, its deep integration with modern education has spawned a new generation of intelligent teaching recording and broadcasting systems.

智能录播系统一般采用摄像头与录播主机分离的方式,即前端采用独立的摄像机获取视频数据,并通过SDI与HDMI等方式将视频输入到录播主机内,录播主机处理视频数据后,一方面存储视频,另一方面通过WiFi或者以太网进行网络直播,该种模式下的直播大都仅限于局域网直播。The intelligent recording and broadcasting system generally adopts the separation method of the camera and the recording and broadcasting host, that is, the front end uses an independent camera to obtain video data, and inputs the video into the recording and broadcasting host through SDI and HDMI. After the recording and broadcasting host processes the video data, a On the one hand, the video is stored, and on the other hand, the network is broadcast live through WiFi or Ethernet. Most of the live broadcasts in this mode are limited to local area network live broadcasts.

摄像头与录播主机分离,导致其占用空间大,结构不够紧凑;基于WiFi或者以太网的网络直播模式,导致其适用场景受到极大的限制;以往录播主机只负责录制与直播,并没有集成相应的智能识别算法,在后期处理时还需花费额外的时间。The camera is separated from the recording and broadcasting host, which results in a large space occupation and an insufficiently compact structure; the network live broadcast mode based on WiFi or Ethernet has greatly limited its applicable scenarios; in the past, the recording and broadcasting host was only responsible for recording and live broadcasting, and did not integrate Corresponding intelligent recognition algorithms will take extra time in post-processing.

发明内容SUMMARY OF THE INVENTION

鉴于以上问题,本发明的目的是提供了一种云台摄像智能分析教学录播一体机,采用了一体化的解决方案,可实现在广域网内的课堂实时直播以及课堂点播,与此同时还能实现课堂场景下目标与行为的自动分析并完成教师与学生的运动实时跟踪。In view of the above problems, the purpose of the present invention is to provide a PTZ camera intelligent analysis teaching recording and broadcasting integrated machine, which adopts an integrated solution, which can realize the real-time live broadcast of the classroom and the classroom on-demand in the wide area network, and at the same time can also Realize automatic analysis of goals and behaviors in classroom scenarios and complete real-time tracking of teachers and students' movements.

本发明的具体技术方案如下:The concrete technical scheme of the present invention is as follows:

一种云台摄像智能分析教学录播一体机,包括云台和搭载在云台上的摄像头单元;A PTZ camera intelligent analysis teaching recording and broadcasting integrated machine, comprising a PTZ and a camera unit mounted on the PTZ;

所述摄像头单元包括摄像头、主机处理模块4G网络模块、视频分析DSP模块、摄像头控制模块、云台控制模块;The camera unit includes a camera, a host processing module 4G network module, a video analysis DSP module, a camera control module, and a PTZ control module;

所述摄像头,用于采集教学场景的视频信号和音频信号;The camera is used to collect video signals and audio signals of the teaching scene;

所述主机处理模块,用于接收视频信号和音频信号,对视频和音频进行编码,封装,封装后的音视频一方面储存于本地,另一方面通过4G网络模块将音视频以流媒体形式推出;The host processing module is used to receive video signals and audio signals, encode the video and audio, and encapsulate the encapsulated audio and video. ;

所述视频分析DSP模块,用于接收视频信号,对视频信号中的目标进行分类识别,根据分类识别的结果,输出运动控制信号给云台控制模块,输出摄像头对焦信号给摄像头控制模块,从而对目标进行实时跟踪;The video analysis DSP module is used to receive the video signal, classify and identify the target in the video signal, and output the motion control signal to the pan-tilt control module according to the result of the classification and identification, and output the camera focus signal to the camera control module, so as to Targets are tracked in real time;

摄像头控制模块,用于控制摄像头的对焦;The camera control module is used to control the focus of the camera;

云台控制模块,用于控制云台的运动。The PTZ control module is used to control the movement of the PTZ.

进一步的,在视频分析DSP模块中,对目标的分类识别包括教师与学生的身份识别以及教师与学生在课堂场景下的特定行为的识别。Further, in the video analysis DSP module, the classification and recognition of the target includes the identification of the teacher and the student and the recognition of the specific behavior of the teacher and the student in the classroom scene.

进一步的,针对教师,其特定行为包括教师板书、教师提问以及教师徘徊;针对学生,其特定行为包括学生举手、起立和坐下。Further, for teachers, their specific behaviors include teacher writing on the blackboard, teachers ask questions, and teachers linger; for students, their specific behaviors include students raising their hands, standing up, and sitting down.

进一步的,对目标进行实时跟踪包括教师行为的跟踪或者学生行为的跟踪;所述教师行为的跟踪,主要包括教师徘徊时云台摄像机的跟踪运动;所述学生行为的跟踪,主要包括学生站立时摄像头的变焦镜头特写以及学生坐下时摄像头的变焦镜头释放。Further, the real-time tracking of the target includes the tracking of the teacher's behavior or the tracking of the student's behavior; the tracking of the teacher's behavior mainly includes the tracking movement of the pan-tilt camera when the teacher is wandering; the tracking of the student's behavior mainly includes when the student is standing. A close-up of the camera's zoom lens and the camera's zoom lens release while the student is seated.

进一步的,所述对视频信号中的目标进行分类识别,通过训练相应的分类器来实现,具体方法如下:Further, the classification and identification of the target in the video signal is realized by training a corresponding classifier, and the specific method is as follows:

S1:训练分类器:S1: Train the classifier:

S11:获取课堂场景下图片训练样本数据集,数据集中包括各种需要识别的动作信息,动作信息:教师板书、教师提问、教师徘徊、学生举手、学生起立、学生坐下;S11: Obtain the image training sample data set in the classroom scene. The data set includes various action information that needs to be identified. Action information: teacher writing on the blackboard, teacher asking questions, teacher wandering, student raising hand, student standing up, and student sitting down;

S12:提取样本特征;S12: Extract sample features;

S13:制作样本训练集,将样本提取后的特征与标签对应;S13: Make a sample training set, and correspond the extracted features of the samples to the labels;

S14:建立分类器模型;S14: establish a classifier model;

S15:将样本训练集进行分类器训练,终止条件为达到预定精度或者达到预定训练次数;S15: Perform classifier training on the sample training set, and the termination condition is that the predetermined precision is reached or the predetermined number of training times is reached;

S2:使用训练好的分类器进行课堂行为动作分类识别:S2: Use the trained classifier to classify and recognize classroom actions:

S21:获取实时视频,并获取运动目标区域;S21: obtain real-time video, and obtain the moving target area;

S22:对运动目标区域进行特征提取;S22: Feature extraction is performed on the moving target area;

S23:将提取的特征使用步骤S1训练得到的分类器进行分类,获得分类结果,对应的即可获取相应的行为动作类别。S23: Classify the extracted features using the classifier trained in step S1 to obtain a classification result, and correspondingly, a corresponding behavior action category can be obtained.

进一步的,所述S12中,提取样本特征使用SIFT特征、HOG特征以及LBP特征。Further, in the S12, SIFT features, HOG features and LBP features are used to extract the sample features.

进一步的,所述S14中,分类器模型采用SVM分类器,K近邻分类器或贝叶斯分类器。Further, in S14, the classifier model adopts SVM classifier, K-nearest neighbor classifier or Bayesian classifier.

进一步的,所述摄像头单元还包括本地数据传输模块,主机处理模块封装后的音视频通过本地数据传输模块直接传输到本地显示。Further, the camera unit further includes a local data transmission module, and the audio and video packaged by the host processing module is directly transmitted to the local display through the local data transmission module.

进一步的,所述4G网络模块能直接访问互联网,用户通过外网直接访问实时流媒体。Further, the 4G network module can directly access the Internet, and users can directly access real-time streaming media through the external network.

进一步的,所述的云台为悬挂式的一体化云台,其水平方向运动采用蜗轮蜗杆传动方式,俯仰方向运动采用行星轮传动方式。Further, the pan/tilt is a suspended integrated pan/tilt, and its horizontal movement adopts a worm gear transmission mode, and its pitch direction movement adopts a planetary gear transmission mode.

相对于现有技术,本发明的有益效果如下:With respect to the prior art, the beneficial effects of the present invention are as follows:

1、采用了一体化录播主机解决方案,解决了以往录播主机系统占用空间大,以及繁琐的安装与配置流程。1. The integrated recording and broadcasting host solution is adopted, which solves the large space occupied by the previous recording and broadcasting host system and the cumbersome installation and configuration process.

2、使用了4G网络进行直播流媒体传输,更加方便用户的随时随地访问。2. The 4G network is used for live streaming media transmission, which is more convenient for users to access anytime, anywhere.

3、配置了课堂场景下专用的教师与学生分析摄像头,智能追踪教师学生行为,极大地提高了教育课堂的智能化程度。3. It is equipped with a dedicated teacher and student analysis camera in the classroom scene to intelligently track the behavior of teachers and students, which greatly improves the intelligence of the education classroom.

附图说明Description of drawings

图1为一体化云台整体结构图;Figure 1 is the overall structure diagram of the integrated PTZ;

图2为一体化云台拆解示意图;Figure 2 is a schematic diagram of the dismantling of the integrated gimbal;

图3为一体化云台减速结构示意图;Figure 3 is a schematic diagram of the deceleration structure of the integrated gimbal;

图4为一体化录播像头主板的具体结构框图;Fig. 4 is the concrete structural block diagram of the main board of the integrated video recording head;

图5为一体化录播摄像头主板的结构示意图;Fig. 5 is the structural representation of the main board of the integrated recording and broadcasting camera;

图6为教学录播一体机功能流程图;Fig. 6 is the functional flow chart of the teaching recording and broadcasting machine;

其中:in:

1:摄像头单元 2:云台1: Camera unit 2: Gimbal

3:俯仰运动输出端盖 4:右侧端盖3: Tilt motion output end cap 4: Right end cap

5:右侧摆臂 6:云台三通5: Right swing arm 6: Gimbal tee

7:连接件 8:轴承7: Connector 8: Bearing

9:云台支架 10:蜗轮轴9: PTZ bracket 10: Worm gear shaft

11:云台减速箱 12:左侧摆臂11: Gimbal gear box 12: Left swing arm

13:俯仰运动动力输入 14:水平运动动力输入13: Power input for pitching motion 14: Power input for horizontal motion

15:俯仰运动动力输出 16:水平运动动力输出15: PTO for pitching motion 16: PTO for horizontal motion

具体实施方式Detailed ways

下面结合附图举例对本发明做更详细的描述:The present invention will be described in more detail below in conjunction with the accompanying drawings:

如图1所示,一种云台摄像智能分析教学录播一体机,包括云台2和搭载在云台2上的摄像头单元1。As shown in FIG. 1 , a PTZ camera intelligent analysis teaching recording and broadcasting integrated machine includes a PTZ 2 and a camera unit 1 mounted on the PTZ 2 .

根据上述方案的本发明,所述的云台2为悬挂式的一体化云台,如图2所示,其水平方向运动采用蜗轮蜗杆传动方式,蜗轮固定于蜗轮轴10上,蜗杆相对于蜗轮做旋转运动,即云台的水平转动,其中云台支架9上的蜗轮轴10与轴承8采用过盈配合,轴承8与连接件7也采用过盈配合,连接件7与云台三通6采用螺纹配合;云台俯仰方向运动采用行星轮传动方式,行星轮与俯仰运动输出端盖3采用内外齿轮啮合方式,行星小齿轮中心轴位置不变,俯仰运动输出端盖3相对其中心轴做旋转运动,驱动左侧摆臂12与右侧摆臂5摆动,从而带动摄像头单元1做俯仰运动,右侧端盖4用于辅助定位俯仰运动输出端盖3,并承载一定的载荷。According to the present invention of the above-mentioned scheme, the described pan/tilt 2 is a suspended integrated pan/tilt. As shown in FIG. 2 , its horizontal movement adopts a worm gear and worm drive mode, the worm gear is fixed on the worm gear shaft 10, and the worm gear is relative to the worm gear. Do a rotary motion, that is, the horizontal rotation of the gimbal, wherein the worm gear shaft 10 on the gimbal bracket 9 and the bearing 8 adopt an interference fit, the bearing 8 and the connecting piece 7 also adopt an interference fit, and the connecting piece 7 and the gimbal tee 6 Thread fit is adopted; the movement in the pitching direction of the gimbal adopts planetary gear transmission, the planetary gear and the pitching movement output end cover 3 are meshed with internal and external gears, the position of the central axis of the planetary pinion remains unchanged, and the pitching movement output end cover 3 is relative to its central axis. The rotation movement drives the left swing arm 12 and the right swing arm 5 to swing, thereby driving the camera unit 1 to perform a pitching motion. The right end cover 4 is used to assist in positioning the pitching motion output end cover 3 and bear a certain load.

根据上述方案的本发明,所述云台减速箱11作为一体化云台的核心部件,具体结构如图3所示,中心位置电机作为俯仰运动动力输入13,驱动行星轮系运动,行星轮机构作为俯仰运动动力输出15,带动摄像头单元1做俯仰运动;偏心位置电机作为水平运动动力输入14,带动蜗杆转动作为水平运动动力输出16,因蜗轮固定不动,最后蜗杆围绕蜗轮转动,即摄像头单元1的水平运动;蜗轮蜗杆传动与行星轮传动都有传动比大,且结构紧凑的优点,非常适合一体化云台应用场景。According to the present invention of the above scheme, the pan-tilt gear box 11 is used as the core component of the integrated pan-tilt head. The specific structure is shown in FIG. 3 . As the power output 15 for pitching motion, it drives the camera unit 1 to do pitching motion; the eccentric position motor acts as the power input 14 for horizontal motion, and drives the worm to rotate as the power output 16 for horizontal motion. 1 horizontal movement; worm gear transmission and planetary gear transmission have the advantages of large transmission ratio and compact structure, which are very suitable for the application scene of the integrated pan/tilt.

根据上述方案的本发明,所述的摄像头单元的主板具体结构框图如图4所示,该主板包含音频信号接口、视频信号接口、主机处理模块、硬盘储存接口、4G网络模块、本地数据传输模块、视频分析DSP模块、摄像头控制模块和云台控制模块;所述视频信号和音频信号分别从摄像头获取并传递给主机处理模块,主机处理模块对音频和视频进行编码,编码后的音视频由主机处理模块继续对其进行封装,封装后的视频一方面通过硬盘存储接口储存于本地,另一方面通过4G网络模块将网络流推出,另一方面可通过本地数据传输模块直接无延时、无损失的传输到本地显示;视频信号同时传递给视频分析DSP模块,对视频信号中的目标进行分类识别,根据分类识别的结果,输出运动控制信号给云台控制模块,输出摄像头对焦信号给摄像头控制模块,从而对目标进行实时跟踪;摄像头控制模块,用于控制摄像头的对焦;云台控制模块,用于控制云台的运动。According to the present invention of the above scheme, the specific structural block diagram of the main board of the camera unit is shown in FIG. 4 , and the main board includes an audio signal interface, a video signal interface, a host processing module, a hard disk storage interface, a 4G network module, and a local data transmission module. , video analysis DSP module, camera control module and PTZ control module; the video signal and audio signal are respectively obtained from the camera and transmitted to the host processing module, the host processing module encodes the audio and video, and the encoded audio and video are processed by the host The processing module continues to encapsulate it. On the one hand, the encapsulated video is stored locally through the hard disk storage interface, on the other hand, the network stream is pushed out through the 4G network module, and on the other hand, it can be directly passed through the local data transmission module without delay and loss. The video signal is simultaneously transmitted to the video analysis DSP module to classify and identify the target in the video signal. According to the result of the classification and identification, the motion control signal is output to the PTZ control module, and the camera focus signal is output to the camera control module. , so as to track the target in real time; the camera control module is used to control the focus of the camera; the PTZ control module is used to control the movement of the PTZ.

根据上述方案的本发明,提供一种较佳实施例一体化录播摄像头主板结构示意图,如图5所示,用于视频处理功能、网络直播功能、录制存储功能、视频点播功能的主机处理模块,其主处理芯片优选海思HI3531A芯片;用于4G网络功能的4G网络模块,其芯片优选MT7620系列芯片,通过SIM卡提供数据达到4G网络传输;用于视频分析功能和运动跟踪功能的视频分析DSP模块,其芯片优选TMS320系列芯片;用于云台控制功能的云台控制模块和摄像头控制模块,其芯片优选STM32系列芯片,通过两路RS232分别控制云台和摄像头相关运动;本地数据传输模块一方面可直接通过HDMI接口向外传输数据,另一方面可通过GV7600芯片并行输出转串行输出,再通过SDI接口向外传输数据;除此之外,摄像头主板提供SATA接口和SD卡接口,用于本地视频储存。According to the present invention of the above scheme, a schematic diagram of the main board structure of the integrated recording and broadcasting camera according to the preferred embodiment is provided, as shown in FIG. , its main processing chip is preferably HiSilicon HI3531A chip; for 4G network module for 4G network function, its chip is preferably MT7620 series chip, which provides data through SIM card to achieve 4G network transmission; video analysis for video analysis function and motion tracking function DSP module, its chip is preferably TMS320 series chip; the PTZ control module and camera control module used for PTZ control function, its chip is preferably STM32 series chip, through two channels of RS232 to control the related motion of PTZ and camera respectively; local data transmission module On the one hand, data can be transmitted directly through the HDMI interface, on the other hand, it can be converted from parallel output to serial output through the GV7600 chip, and then transmitted through the SDI interface. In addition, the camera motherboard provides SATA interface and SD card interface, For local video storage.

根据上述方案的本发明,所述网络直播功能基于4G网络模块访问互联网,用户可以通过外网直接访问该录播主机的实时流媒体,录播主机采用RTMP实时流媒体协议传输音视频实时流,其基本流程如下:According to the present invention of the above scheme, the network live broadcast function accesses the Internet based on the 4G network module, and the user can directly access the real-time streaming media of the recording and broadcasting host through the external network, and the recording and broadcasting host adopts the RTMP real-time streaming media protocol to transmit audio and video real-time streams, The basic process is as follows:

S01:录播主机通过视频处理模块获取实时音视频流,并且将其编码成特定的格式,音频编码成AAC格式,视频编码成H.264格式。S01: The recording and broadcasting host obtains the real-time audio and video stream through the video processing module, and encodes it into a specific format, the audio is encoded in the AAC format, and the video is encoded in the H.264 format.

S02:将编码后的音视频数据按照FLV格式封装,使其符合RTMP传输标准。S02: Encapsulate the encoded audio and video data according to the FLV format, so that it conforms to the RTMP transmission standard.

S03:使用LibRTMP流媒体推流框架,将实时流推送到RTMP流媒体服务器。S03: Use the LibRTMP streaming media push framework to push the real-time stream to the RTMP streaming media server.

S04:RTMP流媒体播放器播放实时流媒体。S04: RTMP streaming media player plays real-time streaming media.

根据上述方案的本发明,视频分析DSP模块对视频信号中的目标进行分类识别。According to the present invention of the above scheme, the video analysis DSP module classifies and recognizes the target in the video signal.

对目标的分类识别包括教师与学生的身份识别以及教师与学生在课堂场景下的特定行为的识别。The classification and identification of the target includes the identification of teachers and students and the identification of specific behaviors of teachers and students in classroom scenarios.

针对教师,其特定行为包括教师板书、教师提问以及教师徘徊;针对学生,其特定行为包括学生举手、起立和坐下。For teachers, their specific behaviors include teacher writing on the blackboard, teachers ask questions, and teachers wander; for students, their specific behaviors include students raising their hands, standing up, and sitting down.

对于课堂场景下教师与学生的行为分析,其通过训练相应的分类器来实现行为的分类识别,具体方法如下:For the behavior analysis of teachers and students in classroom scenarios, the classification and recognition of behaviors are realized by training corresponding classifiers. The specific methods are as follows:

S1:训练分类器:S1: Train the classifier:

S11:获取课堂场景下大量的图片训练样本数据集,其包括各种需要识别的动作信息,如教师板书、教师提问、教师徘徊、学生举手、学生起立、学生坐下;S11: Acquire a large number of image training sample data sets in the classroom scene, which include various action information that needs to be identified, such as teacher writing on the blackboard, teacher asking questions, teacher wandering, students raising their hands, students standing up, and students sitting down;

S12:提取样本特征,如使用SIFT特征、HOG特征以及LBP特征。S12: Extract sample features, such as using SIFT features, HOG features, and LBP features.

S13:制作样本训练集,将样本提取后的特征与标签对应。S13: Create a sample training set, and map the extracted features of the samples to labels.

S14:建立分类器模型,如SVM分类器,K近邻分类器或者贝叶斯分类器。S14: Build a classifier model, such as SVM classifier, K-nearest neighbor classifier or Bayesian classifier.

S15:将样本训练集进行分类器训练,终止条件为达到预定精度或者达到预定训练次数。S15: Perform classifier training on the sample training set, and the termination condition is reaching a predetermined precision or reaching a predetermined number of training times.

S2:使用训练好的分类器进行课堂行为动作识别:S2: Classroom action action recognition using the trained classifier:

S21:获取实时视频,并采用帧差法或者背景法获取运动目标区域。S21: Obtain a real-time video, and use a frame difference method or a background method to obtain a moving target area.

S22:对运动目标区域进行特征提取,如SIFT特征、HOG特征以及LBP特征,与样本制作方式保持一致。S22: Perform feature extraction on the moving target area, such as SIFT features, HOG features, and LBP features, consistent with the sample production method.

S23:将提取的特征使用分类器进行分类,获得分类结果,对应的即可获取相应的行为动作类别。S23: Use the classifier to classify the extracted features to obtain a classification result, and correspondingly, the corresponding behavior action category can be obtained.

根据上述方案的本发明,所述的运动跟踪功能可用于教师行为的跟踪或者学生行为的跟踪;所述教师行为的跟踪,主要包括教师徘徊时云台摄像机的跟踪运动;所述学生行为的跟踪,主要包括学生站立时摄像头的变焦镜头特写以及学生坐下时摄像头的变焦镜头释放。运动跟踪是基于行为识别实现的,先根据视频分析DSP模块获取对应的行为动作,并根据具体的动作类型执行相应的跟踪类型,如动作类型为教师徘徊,则启动云台控制模块,控制云台运动,以达到运动跟踪的效果;如动作类型为学生起立,则同时控制云台控制模块与摄像头控制模块,首先云台控制模块控制云台运动到对应学生位于图像正中间,然后摄像头控制模块控制变焦镜头特写,将镜头拉近以达到学生特写的效果。According to the present invention of the above scheme, the motion tracking function can be used for the tracking of the teacher's behavior or the tracking of the student's behavior; the tracking of the teacher's behavior mainly includes the tracking motion of the PTZ camera when the teacher is wandering; the tracking of the student's behavior , mainly including the close-up of the zoom lens of the camera when the student is standing and the release of the zoom lens of the camera when the student is sitting down. Motion tracking is realized based on behavior recognition. First, the corresponding behavior actions are obtained according to the video analysis DSP module, and the corresponding tracking type is executed according to the specific action type. If the action type is teacher wandering, the PTZ control module is activated to control the PTZ. Movement to achieve the effect of motion tracking; if the action type is the student standing up, control the PTZ control module and the camera control module at the same time, first the PTZ control module controls the PTZ to move until the corresponding student is in the middle of the image, and then the camera control module controls Zoom lens close-up, zoom in to achieve the effect of close-up of students.

根据上述方案的本发明,所述的教学录播一体机各个功能模块实现的流程图如图6所示,所述的一体机系统开启时,可以选择录制或者点播;系统选择录制时,一方面摄像头采集的音视频通过主处理单元进行音视频编码,然后音视频封装成flv格式视频,分别用于本地存储、网络直播和本地播放,另一方面摄像头采集的视频信号用于视频运动分析,分析的结果用来运动跟踪;系统选择点播时,通过4G网络访问本地存储,选择需要观看的录制记录,下载点播视频到本地,完成播放。According to the present invention of the above scheme, the flow chart of the realization of each functional module of the teaching recording and broadcasting integrated machine is shown in FIG. 6 , when the integrated machine system is turned on, recording or on-demand can be selected; when the system chooses recording, on the one hand The audio and video collected by the camera are encoded by the main processing unit, and then the audio and video are encapsulated into flv format video, which are respectively used for local storage, network live broadcast and local playback. On the other hand, the video signal collected by the camera is used for video motion analysis and analysis. The result is used for motion tracking; when the system selects on-demand, it accesses the local storage through the 4G network, selects the recording record to be watched, downloads the on-demand video to the local, and completes the playback.

以上所述仅为本发明的优选实施例,并非因此限制本发明的专利范围,凡是利用本发明说明书及附图内容所作的等效结构或等效流程变换,或直接或间接运用在其他相关的技术领域,均同理包括在本发明的专利保护范围内。The above descriptions are only preferred embodiments of the present invention, and are not intended to limit the scope of the present invention. Any equivalent structure or equivalent process transformation made by using the contents of the description and drawings of the present invention, or directly or indirectly applied to other related All technical fields are similarly included in the scope of patent protection of the present invention.

Claims (9)

1. A cloud deck camera shooting intelligent analysis teaching recording and broadcasting integrated machine is characterized by comprising a cloud deck and a camera unit carried on the cloud deck;
the camera unit comprises a camera, a host processing module 4G network module, a video analysis DSP module, a camera control module and a holder control module;
the camera is used for collecting video signals and audio signals of a teaching scene;
the host processing module is used for receiving the video signal and the audio signal, coding and packaging the video and the audio, and the packaged audio and video are stored locally on the one hand and are pushed out in a streaming media form through the 4G network module on the other hand;
the video analysis DSP module is used for receiving the video signals, classifying and identifying targets in the video signals, outputting a motion control signal to the holder control module and outputting a camera focusing signal to the camera control module according to the classification and identification results, and accordingly tracking the targets in real time;
the camera control module is used for controlling the focusing of the camera;
the holder control module is used for controlling the movement of the holder;
the classification and identification of the target in the video signal are realized by training a corresponding classifier, and the specific method is as follows:
s1: training a classifier:
s11: acquiring a picture training sample data set in a classroom scene, wherein the data set comprises various action information needing to be identified: teacher writing on blackboard, teacher asking questions, teacher loitering, student lifting hands, student standing up, student sitting down;
s12: extracting sample characteristics;
s13: making a sample training set, and corresponding the extracted features of the sample to the label;
s14: establishing a classifier model;
s15: carrying out classifier training on the sample training set, wherein the termination condition is that the preset precision is reached or the preset training times are reached;
s2: using the trained classifier to classify and recognize the classroom behavior and actions:
s21: acquiring a real-time video and a moving target area;
s22: extracting the characteristics of the moving target area;
s23: and classifying the extracted features by using the classifier obtained by training in the step S1 to obtain a classification result, and correspondingly obtaining the corresponding behavior and action category.
2. The pan-tilt-zoom-camera intelligent analysis teaching recording and broadcasting all-in-one machine as claimed in claim 1, wherein in the video analysis DSP module, the classification and identification of the targets comprises identification of teachers and students and identification of specific behaviors of teachers and students in classroom scenes.
3. The intelligent cloud deck camera analysis, teaching and recording integrated machine as claimed in claim 2, wherein specific behaviors of the teacher comprise teacher writing, teacher asking questions and teacher wandering; specific behaviors for a student include the student lifting his hands, standing up and sitting down.
4. The pan-tilt-zoom-camera intelligent analysis teaching recording and broadcasting all-in-one machine according to claim 3, wherein the real-time tracking of the target comprises the tracking of teacher behaviors or the tracking of student behaviors; the teacher behavior tracking mainly comprises the tracking motion of a pan-tilt camera when the teacher wanders; the tracking of the student behaviors mainly comprises the close-up of a zoom lens of a camera when a student stands and the release of the zoom lens of the camera when the student sits down.
5. The pan-tilt-zoom-camera intelligent analysis teaching and recording all-in-one machine according to claim 1, wherein in the step S12, the SIFT feature, the HOG feature and the LBP feature are used for extracting the sample features.
6. The tripod head camera shooting intelligent analysis teaching recording and broadcasting all-in-one machine as claimed in claim 1, wherein in the S14, an SVM classifier, a K neighbor classifier or a bayesian classifier is adopted as a classifier model.
7. The intelligent analysis teaching recording and broadcasting integrated machine with the pan-tilt camera shooting function as claimed in claim 1, wherein the camera unit further comprises a local data transmission module, and the audio and video packaged by the host processing module is directly transmitted to a local display through the local data transmission module.
8. The pan-tilt camera shooting intelligent analysis teaching recording and broadcasting all-in-one machine as claimed in claim 1, wherein the 4G network module can directly access the Internet, and a user directly accesses real-time streaming media through an external network.
9. The integrated machine of claim 1, wherein the cradle head is a suspended integrated cradle head, the horizontal movement of which adopts a worm and gear transmission mode, and the pitch movement of which adopts a planet gear transmission mode.
CN201910588415.8A 2019-07-01 2019-07-01 A PTZ camera intelligent analysis teaching recording and broadcasting integrated machine Expired - Fee Related CN110266984B (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN201910588415.8A CN110266984B (en) 2019-07-01 2019-07-01 A PTZ camera intelligent analysis teaching recording and broadcasting integrated machine

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN201910588415.8A CN110266984B (en) 2019-07-01 2019-07-01 A PTZ camera intelligent analysis teaching recording and broadcasting integrated machine

Publications (2)

Publication Number Publication Date
CN110266984A CN110266984A (en) 2019-09-20
CN110266984B true CN110266984B (en) 2020-12-18

Family

ID=67923717

Family Applications (1)

Application Number Title Priority Date Filing Date
CN201910588415.8A Expired - Fee Related CN110266984B (en) 2019-07-01 2019-07-01 A PTZ camera intelligent analysis teaching recording and broadcasting integrated machine

Country Status (1)

Country Link
CN (1) CN110266984B (en)

Families Citing this family (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN115460379A (en) * 2022-09-05 2022-12-09 南京逸智网络空间技术创新研究院有限公司 Teaching recording and broadcasting guide system and method based on Haesi embedded platform

Citations (10)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN101635835A (en) * 2008-07-25 2010-01-27 深圳市信义科技有限公司 Intelligent video monitoring method and system thereof
CN103905734A (en) * 2014-04-17 2014-07-02 苏州科达科技股份有限公司 Method and device for intelligent tracking and photographing
US9113212B2 (en) * 1998-05-06 2015-08-18 Tivo Inc. Simultaneous recording and playback of audio/video programs
CN204669511U (en) * 2015-05-04 2015-09-23 广州盈可视电子科技有限公司 A kind of automatic recorded broadcast tracking system of integration
CN105139702A (en) * 2015-10-14 2015-12-09 广州天莱软件科技有限公司 Recording and broadcasting system used for teaching and use method thereof
CN106096666A (en) * 2016-06-24 2016-11-09 惠州紫旭科技有限公司 A kind of method and apparatus reducing recording and broadcasting system students ' behavior analysis erroneous judgement
CN205827430U (en) * 2016-04-19 2016-12-21 深圳正谱云教育技术有限公司 Camera to automatically track system based on single-lens image Dynamic Recognition
CN106803913A (en) * 2017-03-10 2017-06-06 武汉东信同邦信息技术有限公司 A kind of detection method and its device of the action that taken the floor for Auto-Sensing student
CN107105207A (en) * 2017-06-09 2017-08-29 北京深瞐科技有限公司 Target monitoring method, target monitoring device and video camera
CN108229352A (en) * 2017-12-21 2018-06-29 上海交通大学 A kind of standing detection method based on deep learning

Patent Citations (10)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US9113212B2 (en) * 1998-05-06 2015-08-18 Tivo Inc. Simultaneous recording and playback of audio/video programs
CN101635835A (en) * 2008-07-25 2010-01-27 深圳市信义科技有限公司 Intelligent video monitoring method and system thereof
CN103905734A (en) * 2014-04-17 2014-07-02 苏州科达科技股份有限公司 Method and device for intelligent tracking and photographing
CN204669511U (en) * 2015-05-04 2015-09-23 广州盈可视电子科技有限公司 A kind of automatic recorded broadcast tracking system of integration
CN105139702A (en) * 2015-10-14 2015-12-09 广州天莱软件科技有限公司 Recording and broadcasting system used for teaching and use method thereof
CN205827430U (en) * 2016-04-19 2016-12-21 深圳正谱云教育技术有限公司 Camera to automatically track system based on single-lens image Dynamic Recognition
CN106096666A (en) * 2016-06-24 2016-11-09 惠州紫旭科技有限公司 A kind of method and apparatus reducing recording and broadcasting system students ' behavior analysis erroneous judgement
CN106803913A (en) * 2017-03-10 2017-06-06 武汉东信同邦信息技术有限公司 A kind of detection method and its device of the action that taken the floor for Auto-Sensing student
CN107105207A (en) * 2017-06-09 2017-08-29 北京深瞐科技有限公司 Target monitoring method, target monitoring device and video camera
CN108229352A (en) * 2017-12-21 2018-06-29 上海交通大学 A kind of standing detection method based on deep learning

Also Published As

Publication number Publication date
CN110266984A (en) 2019-09-20

Similar Documents

Publication Publication Date Title
CN112562433B (en) A working method of 5G strong interactive remote delivery teaching system based on holographic terminal
Morgado et al. Learning representations from audio-visual spatial alignment
CN113691836B (en) Video template generation method, video generation method and device and electronic equipment
CN107945592B (en) Synchronous mutual-aid classroom teaching system
CN202601002U (en) A recording and playing system with manual and automatic operations
CN103905734A (en) Method and device for intelligent tracking and photographing
CN107135333A (en) A kind of teaching writing/playing system
CN207382443U (en) A kind of intelligent teaching recording and broadcasting system
CN114638732A (en) An artificial intelligence intelligent education platform and its application
US20220335246A1 (en) System And Method For Video Processing
CN110266984B (en) A PTZ camera intelligent analysis teaching recording and broadcasting integrated machine
CN104469304A (en) Intelligent recording and playing system for performance training
CN112040256A (en) A method and system for video annotation of live experimental teaching process
Binh et al. Detecting student engagement in classrooms for intelligent tutoring systems
CN107864354A (en) A kind of method of electronic whiteboard intelligence recorded broadcast
CN108831220A (en) A kind of interaction multimedia tutoring system based on speech recognition
CN119011753A (en) Teaching system and method based on AI intelligent tracking analysis
CN109862375B (en) Cloud recording and broadcasting system
CN208873311U (en) The man-machine robot system for teaching mode altogether
CN110933350A (en) Electronic cloud mirror recording and broadcasting system, method and device
CN204334840U (en) An intelligent teaching and recording system
CN118764693A (en) Method, device, equipment and storage medium for generating video blog
CN217543870U (en) Interactive teaching classroom system
CN113709364B (en) Camera identifying equipment and object identifying method
CN113596367A (en) Intelligent course recording and broadcasting system for experiment teaching

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination
GR01 Patent grant
GR01 Patent grant
CF01 Termination of patent right due to non-payment of annual fee
CF01 Termination of patent right due to non-payment of annual fee

Granted publication date: 20201218

点击 这是indexloc提供的php浏览器服务,不要输入任何密码和下载