+

CN110266984A - A pan-tilt camera intelligent analysis teaching recording and broadcasting all-in-one machine - Google Patents

A pan-tilt camera intelligent analysis teaching recording and broadcasting all-in-one machine Download PDF

Info

Publication number
CN110266984A
CN110266984A CN201910588415.8A CN201910588415A CN110266984A CN 110266984 A CN110266984 A CN 110266984A CN 201910588415 A CN201910588415 A CN 201910588415A CN 110266984 A CN110266984 A CN 110266984A
Authority
CN
China
Prior art keywords
camera
pan
video
tilt
broadcasting
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Granted
Application number
CN201910588415.8A
Other languages
Chinese (zh)
Other versions
CN110266984B (en
Inventor
王宣银
杜双龙
莫奇
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Zhejiang University ZJU
Original Assignee
Zhejiang University ZJU
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Zhejiang University ZJU filed Critical Zhejiang University ZJU
Priority to CN201910588415.8A priority Critical patent/CN110266984B/en
Publication of CN110266984A publication Critical patent/CN110266984A/en
Application granted granted Critical
Publication of CN110266984B publication Critical patent/CN110266984B/en
Expired - Fee Related legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Classifications

    • GPHYSICS
    • G06COMPUTING OR CALCULATING; COUNTING
    • G06VIMAGE OR VIDEO RECOGNITION OR UNDERSTANDING
    • G06V20/00Scenes; Scene-specific elements
    • G06V20/40Scenes; Scene-specific elements in video content
    • G06V20/41Higher-level, semantic clustering, classification or understanding of video scenes, e.g. detection, labelling or Markovian modelling of sport events or news items
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N21/00Selective content distribution, e.g. interactive television or video on demand [VOD]
    • H04N21/20Servers specifically adapted for the distribution of content, e.g. VOD servers; Operations thereof
    • H04N21/21Server components or server architectures
    • H04N21/218Source of audio or video content, e.g. local disk arrays
    • H04N21/2187Live feed
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N21/00Selective content distribution, e.g. interactive television or video on demand [VOD]
    • H04N21/40Client devices specifically adapted for the reception of or interaction with content, e.g. set-top-box [STB]; Operations thereof
    • H04N21/47End-user applications
    • H04N21/472End-user interface for requesting content, additional data or services; End-user interface for interacting with content, e.g. for content reservation or setting reminders, for requesting event notification, for manipulating displayed content
    • H04N21/47202End-user interface for requesting content, additional data or services; End-user interface for interacting with content, e.g. for content reservation or setting reminders, for requesting event notification, for manipulating displayed content for requesting content on demand, e.g. video on demand
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N23/00Cameras or camera modules comprising electronic image sensors; Control thereof
    • H04N23/60Control of cameras or camera modules
    • H04N23/67Focus control based on electronic image sensor signals
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N5/00Details of television systems
    • H04N5/76Television signal recording

Landscapes

  • Engineering & Computer Science (AREA)
  • Multimedia (AREA)
  • Signal Processing (AREA)
  • Databases & Information Systems (AREA)
  • Computational Linguistics (AREA)
  • Software Systems (AREA)
  • Physics & Mathematics (AREA)
  • General Physics & Mathematics (AREA)
  • Theoretical Computer Science (AREA)
  • Human Computer Interaction (AREA)
  • Closed-Circuit Television Systems (AREA)

Abstract

本发明公开了一种云台摄像智能分析教学录播一体机,包括云台和搭载在云台上的摄像头单元;摄像头单元包括:摄像头,用于采集教学场景的音视频信号;主机处理模块,用于接收音视频信号,对音视频信号进行编码,封装,封装后的音视频一方面储存于本地,另一方面通过4G网络模块将音视频以流媒体形式推出;视频分析DSP模块,用于接收视频信号,对视频信号中的目标进行分类识别,根据分类识别的结果,输出运动控制信号给云台控制模块,输出摄像头对焦信号给摄像头控制模块,从而对目标进行实时跟踪;本发明将摄像头与云台一体化,对课堂场景下特定目标行为的识别与分析,结合4G网络的实时传输,有助于进一步提升教学课堂的智能化程度。

The invention discloses a pan-tilt camera intelligent analysis teaching recording and broadcasting integrated machine, which includes a pan-tilt and a camera unit mounted on the pan-tilt; the camera unit includes: a camera for collecting audio and video signals of a teaching scene; a host processing module, It is used to receive audio and video signals, encode and encapsulate the audio and video signals. On the one hand, the encapsulated audio and video are stored locally, and on the other hand, the audio and video are released in the form of streaming media through the 4G network module; the video analysis DSP module is used for Receive the video signal, classify and recognize the target in the video signal, output the motion control signal to the pan/tilt control module according to the result of classification and recognition, and output the camera focus signal to the camera control module, so as to track the target in real time; the present invention uses the camera Integrated with the cloud platform, the identification and analysis of specific target behaviors in the classroom scene, combined with the real-time transmission of the 4G network, will help to further improve the intelligence of the teaching classroom.

Description

一种云台摄像智能分析教学录播一体机A pan-tilt camera intelligent analysis teaching recording and broadcasting all-in-one machine

技术领域technical field

本发明涉及智慧教学领域,尤其是一种云台摄像智能分析教学录播一体机。The invention relates to the field of intelligent teaching, in particular to a pan-tilt camera intelligent analysis teaching recording and broadcasting all-in-one machine.

技术背景technical background

随着计算机视觉与网络流媒体技术的不断发展,加之云计算、移动互联网以及大数据分析等技术的不断成熟,其与现代教育的深度融合,催生了新一代智能教学录播系统。With the continuous development of computer vision and network streaming media technology, coupled with the continuous maturity of cloud computing, mobile Internet and big data analysis technology, its deep integration with modern education has given birth to a new generation of intelligent teaching recording and broadcasting system.

智能录播系统一般采用摄像头与录播主机分离的方式,即前端采用独立的摄像机获取视频数据,并通过SDI与HDMI等方式将视频输入到录播主机内,录播主机处理视频数据后,一方面存储视频,另一方面通过WiFi或者以太网进行网络直播,该种模式下的直播大都仅限于局域网直播。The intelligent recording and broadcasting system generally adopts the method of separating the camera from the recording and broadcasting host, that is, the front end uses an independent camera to obtain video data, and inputs the video into the recording and broadcasting host through SDI and HDMI, etc. After the recording and broadcasting host processes the video data, a On the one hand, the video is stored, and on the other hand, the network broadcast is performed through WiFi or Ethernet. Most of the live broadcasts in this mode are limited to LAN live broadcasts.

摄像头与录播主机分离,导致其占用空间大,结构不够紧凑;基于WiFi或者以太网的网络直播模式,导致其适用场景受到极大的限制;以往录播主机只负责录制与直播,并没有集成相应的智能识别算法,在后期处理时还需花费额外的时间。The camera is separated from the recording and broadcasting host, resulting in a large space occupation and insufficient compact structure; the network live broadcast mode based on WiFi or Ethernet has greatly restricted its applicable scenarios; in the past, the recording and broadcasting host was only responsible for recording and live broadcasting, and did not integrate The corresponding intelligent recognition algorithm will take additional time in post-processing.

发明内容Contents of the invention

鉴于以上问题,本发明的目的是提供了一种云台摄像智能分析教学录播一体机,采用了一体化的解决方案,可实现在广域网内的课堂实时直播以及课堂点播,与此同时还能实现课堂场景下目标与行为的自动分析并完成教师与学生的运动实时跟踪。In view of the above problems, the purpose of the present invention is to provide a cloud platform camera intelligent analysis teaching recording and broadcasting all-in-one machine, which adopts an integrated solution, which can realize real-time live broadcast and classroom on-demand in the wide area network, and at the same time can also Realize the automatic analysis of goals and behaviors in the classroom scene and complete the real-time tracking of the movement of teachers and students.

本发明的具体技术方案如下:Concrete technical scheme of the present invention is as follows:

一种云台摄像智能分析教学录播一体机,包括云台和搭载在云台上的摄像头单元;A pan-tilt camera intelligent analysis teaching recording and broadcasting all-in-one machine, comprising a pan-tilt and a camera unit mounted on the pan-tilt;

所述摄像头单元包括摄像头、主机处理模块4G网络模块、视频分析DSP模块、摄像头控制模块、云台控制模块;The camera unit includes a camera, a host processing module 4G network module, a video analysis DSP module, a camera control module, and a cloud platform control module;

所述摄像头,用于采集教学场景的视频信号和音频信号;The camera is used to collect video signals and audio signals of teaching scenes;

所述主机处理模块,用于接收视频信号和音频信号,对视频和音频进行编码,封装,封装后的音视频一方面储存于本地,另一方面通过4G网络模块将音视频以流媒体形式推出;The host processing module is used to receive video signals and audio signals, encode the video and audio, and package them. On the one hand, the packaged audio and video are stored locally; ;

所述视频分析DSP模块,用于接收视频信号,对视频信号中的目标进行分类识别,根据分类识别的结果,输出运动控制信号给云台控制模块,输出摄像头对焦信号给摄像头控制模块,从而对目标进行实时跟踪;Described video analysis DSP module, is used for receiving video signal, carries out classification recognition to the target in video signal, according to the result of classification recognition, output motion control signal to cloud platform control module, output camera focusing signal to camera control module, thereby to Real-time tracking of targets;

摄像头控制模块,用于控制摄像头的对焦;The camera control module is used to control the focus of the camera;

云台控制模块,用于控制云台的运动。The pan-tilt control module is used to control the motion of the pan-tilt.

进一步的,在视频分析DSP模块中,对目标的分类识别包括教师与学生的身份识别以及教师与学生在课堂场景下的特定行为的识别。Furthermore, in the video analysis DSP module, the classification and identification of targets includes the identification of teachers and students and the identification of specific behaviors of teachers and students in classroom scenes.

进一步的,针对教师,其特定行为包括教师板书、教师提问以及教师徘徊;针对学生,其特定行为包括学生举手、起立和坐下。Further, for teachers, the specific behaviors include writing on the blackboard, asking questions, and wandering; for students, the specific behaviors include raising hands, standing up and sitting down.

进一步的,对目标进行实时跟踪包括教师行为的跟踪或者学生行为的跟踪;所述教师行为的跟踪,主要包括教师徘徊时云台摄像机的跟踪运动;所述学生行为的跟踪,主要包括学生站立时摄像头的变焦镜头特写以及学生坐下时摄像头的变焦镜头释放。Further, the real-time tracking of the target includes the tracking of the teacher's behavior or the tracking of the student's behavior; the tracking of the teacher's behavior mainly includes the tracking movement of the pan-tilt camera when the teacher is wandering; the tracking of the student's behavior mainly includes when the student is standing. A close-up of the camera's zoom and a release of the camera's zoom while the student is seated.

进一步的,所述对视频信号中的目标进行分类识别,通过训练相应的分类器来实现,具体方法如下:Further, the classification and recognition of the target in the video signal is realized by training a corresponding classifier, and the specific method is as follows:

S1:训练分类器:S1: Train the classifier:

S11:获取课堂场景下图片训练样本数据集,数据集中包括各种需要识别的动作信息,动作信息:教师板书、教师提问、教师徘徊、学生举手、学生起立、学生坐下;S11: Obtain the image training sample data set in the classroom scene. The data set includes various action information that needs to be recognized. Action information: teacher writing on blackboard, teacher asking questions, teacher wandering, students raising hands, students standing up, students sitting down;

S12:提取样本特征;S12: Extract sample features;

S13:制作样本训练集,将样本提取后的特征与标签对应;S13: Make a sample training set, and correspond the features extracted from the samples to the labels;

S14:建立分类器模型;S14: Establish a classifier model;

S15:将样本训练集进行分类器训练,终止条件为达到预定精度或者达到预定训练次数;S15: Perform classifier training on the sample training set, and the termination condition is to reach a predetermined accuracy or reach a predetermined number of training times;

S2:使用训练好的分类器进行课堂行为动作分类识别:S2: Use the trained classifier to classify and recognize classroom behavior actions:

S21:获取实时视频,并获取运动目标区域;S21: Obtain real-time video, and obtain a moving target area;

S22:对运动目标区域进行特征提取;S22: Perform feature extraction on the moving target area;

S23:将提取的特征使用步骤S1训练得到的分类器进行分类,获得分类结果,对应的即可获取相应的行为动作类别。S23: Use the classifier trained in step S1 to classify the extracted features to obtain classification results, and corresponding behavior categories can be obtained accordingly.

进一步的,所述S12中,提取样本特征使用SIFT特征、HOG特征以及LBP特征。Further, in the S12, extracting sample features uses SIFT features, HOG features and LBP features.

进一步的,所述S14中,分类器模型采用SVM分类器,K近邻分类器或贝叶斯分类器。Further, in S14, the classifier model adopts SVM classifier, K nearest neighbor classifier or Bayesian classifier.

进一步的,所述摄像头单元还包括本地数据传输模块,主机处理模块封装后的音视频通过本地数据传输模块直接传输到本地显示。Further, the camera unit also includes a local data transmission module, and the audio and video packaged by the host processing module are directly transmitted to the local display through the local data transmission module.

进一步的,所述4G网络模块能直接访问互联网,用户通过外网直接访问实时流媒体。Further, the 4G network module can directly access the Internet, and users can directly access real-time streaming media through the external network.

进一步的,所述的云台为悬挂式的一体化云台,其水平方向运动采用蜗轮蜗杆传动方式,俯仰方向运动采用行星轮传动方式。Further, the above-mentioned pan-tilt is a suspended integrated pan-tilt, its horizontal movement adopts a worm gear transmission mode, and its pitch direction movement adopts a planetary gear transmission mode.

相对于现有技术,本发明的有益效果如下:Compared with the prior art, the beneficial effects of the present invention are as follows:

1、采用了一体化录播主机解决方案,解决了以往录播主机系统占用空间大,以及繁琐的安装与配置流程。1. The integrated recording and broadcasting host solution is adopted, which solves the large space occupation of the previous recording and broadcasting host system and the cumbersome installation and configuration process.

2、使用了4G网络进行直播流媒体传输,更加方便用户的随时随地访问。2. The 4G network is used for live streaming media transmission, which is more convenient for users to access anytime and anywhere.

3、配置了课堂场景下专用的教师与学生分析摄像头,智能追踪教师学生行为,极大地提高了教育课堂的智能化程度。3. Equipped with a dedicated teacher and student analysis camera in the classroom scene, intelligently tracking the behavior of teachers and students, which greatly improves the intelligence of the educational classroom.

附图说明Description of drawings

图1为一体化云台整体结构图;Fig. 1 is the overall structural diagram of the integrated platform;

图2为一体化云台拆解示意图;Figure 2 is a schematic diagram of the disassembly of the integrated gimbal;

图3为一体化云台减速结构示意图;Fig. 3 is a schematic diagram of the integrated pan/tilt deceleration structure;

图4为一体化录播像头主板的具体结构框图;Fig. 4 is the specific structural block diagram of the mainboard of the integrated recording and broadcasting camera;

图5为一体化录播摄像头主板的结构示意图;Fig. 5 is a structural schematic diagram of the mainboard of the integrated recording and broadcasting camera;

图6为教学录播一体机功能流程图;Figure 6 is a functional flow chart of the teaching recording and broadcasting all-in-one machine;

其中:in:

1:摄像头单元 2:云台1: Camera unit 2: PTZ

3:俯仰运动输出端盖 4:右侧端盖3: Pitch motion output end cover 4: Right end cover

5:右侧摆臂 6:云台三通5: Right side swing arm 6: PTZ tee

7:连接件 8:轴承7: Connector 8: Bearing

9:云台支架 10:蜗轮轴9: Pan-tilt bracket 10: Worm gear shaft

11:云台减速箱 12:左侧摆臂11: Gimbal reducer 12: Left swing arm

13:俯仰运动动力输入 14:水平运动动力输入13: Pitching motion power input 14: Horizontal motion power input

15:俯仰运动动力输出 16:水平运动动力输出15: Pitching motion power output 16: Horizontal motion power output

具体实施方式Detailed ways

下面结合附图举例对本发明做更详细的描述:The present invention is described in more detail below in conjunction with accompanying drawing example:

如图1所示,一种云台摄像智能分析教学录播一体机,包括云台2和搭载在云台2上的摄像头单元1。As shown in FIG. 1 , a pan/tilt camera intelligent analysis teaching recording and broadcasting all-in-one machine includes a pan/tilt 2 and a camera unit 1 mounted on the pan/tilt 2 .

根据上述方案的本发明,所述的云台2为悬挂式的一体化云台,如图2所示,其水平方向运动采用蜗轮蜗杆传动方式,蜗轮固定于蜗轮轴10上,蜗杆相对于蜗轮做旋转运动,即云台的水平转动,其中云台支架9上的蜗轮轴10与轴承8采用过盈配合,轴承8与连接件7也采用过盈配合,连接件7与云台三通6采用螺纹配合;云台俯仰方向运动采用行星轮传动方式,行星轮与俯仰运动输出端盖3采用内外齿轮啮合方式,行星小齿轮中心轴位置不变,俯仰运动输出端盖3相对其中心轴做旋转运动,驱动左侧摆臂12与右侧摆臂5摆动,从而带动摄像头单元1做俯仰运动,右侧端盖4用于辅助定位俯仰运动输出端盖3,并承载一定的载荷。According to the present invention of above-mentioned scheme, described cloud platform 2 is the integral cloud platform of suspended type, as shown in Figure 2, its horizontal motion adopts worm gear transmission mode, and worm gear is fixed on the worm gear shaft 10, and worm screw is relative to worm gear Rotating motion, that is, the horizontal rotation of the pan/tilt, wherein the worm gear shaft 10 on the pan/tilt bracket 9 and the bearing 8 adopt an interference fit, and the bearing 8 and the connecting piece 7 also adopt an interference fit, and the connecting piece 7 and the pan/tilt tee 6 Thread fit is adopted; the motion of the pan/tilt in the pitch direction adopts planetary gear transmission, and the planetary gear and the output end cover 3 of the pitching motion adopt an internal and external gear meshing mode. The rotation movement drives the left swing arm 12 and the right swing arm 5 to swing, thereby driving the camera unit 1 to do a pitching motion, and the right end cover 4 is used to assist in positioning the output end cover 3 of the pitching motion, and carries a certain load.

根据上述方案的本发明,所述云台减速箱11作为一体化云台的核心部件,具体结构如图3所示,中心位置电机作为俯仰运动动力输入13,驱动行星轮系运动,行星轮机构作为俯仰运动动力输出15,带动摄像头单元1做俯仰运动;偏心位置电机作为水平运动动力输入14,带动蜗杆转动作为水平运动动力输出16,因蜗轮固定不动,最后蜗杆围绕蜗轮转动,即摄像头单元1的水平运动;蜗轮蜗杆传动与行星轮传动都有传动比大,且结构紧凑的优点,非常适合一体化云台应用场景。According to the present invention of above-mentioned scheme, described pan-tilt reduction box 11 is as the core component of integrated pan-tilt, and specific structure is as shown in Figure 3, and center position motor is used as pitch motion power input 13, drives planetary gear train motion, and planetary gear mechanism As the pitching motion power output 15, it drives the camera unit 1 to do the pitching motion; the eccentric position motor is used as the horizontal motion power input 14, and drives the worm to rotate as the horizontal motion power output 16, because the worm wheel is fixed, and finally the worm rotates around the worm wheel, that is, the camera unit 1 horizontal movement; both worm gear transmission and planetary gear transmission have the advantages of large transmission ratio and compact structure, which are very suitable for the application scene of the integrated pan/tilt.

根据上述方案的本发明,所述的摄像头单元的主板具体结构框图如图4所示,该主板包含音频信号接口、视频信号接口、主机处理模块、硬盘储存接口、4G网络模块、本地数据传输模块、视频分析DSP模块、摄像头控制模块和云台控制模块;所述视频信号和音频信号分别从摄像头获取并传递给主机处理模块,主机处理模块对音频和视频进行编码,编码后的音视频由主机处理模块继续对其进行封装,封装后的视频一方面通过硬盘存储接口储存于本地,另一方面通过4G网络模块将网络流推出,另一方面可通过本地数据传输模块直接无延时、无损失的传输到本地显示;视频信号同时传递给视频分析DSP模块,对视频信号中的目标进行分类识别,根据分类识别的结果,输出运动控制信号给云台控制模块,输出摄像头对焦信号给摄像头控制模块,从而对目标进行实时跟踪;摄像头控制模块,用于控制摄像头的对焦;云台控制模块,用于控制云台的运动。According to the present invention of the above-mentioned scheme, the specific structural block diagram of the motherboard of the camera unit is as shown in Figure 4, the motherboard includes an audio signal interface, a video signal interface, a host processing module, a hard disk storage interface, a 4G network module, and a local data transmission module , video analysis DSP module, camera control module and pan-tilt control module; Described video signal and audio signal are obtained from camera respectively and delivered to host processing module, and host processing module encodes audio frequency and video, and the audio-video after encoding is by host The processing module continues to encapsulate it. On the one hand, the encapsulated video is stored locally through the hard disk storage interface, on the other hand, the network stream is pushed out through the 4G network module, and on the other hand, it can be directly transmitted through the local data transmission module without delay or loss. The video signal is transmitted to the local display; the video signal is transmitted to the video analysis DSP module at the same time, and the target in the video signal is classified and recognized. According to the result of classification and recognition, the motion control signal is output to the pan/tilt control module, and the camera focus signal is output to the camera control module. , so as to track the target in real time; the camera control module is used to control the focus of the camera; the pan-tilt control module is used to control the movement of the pan-tilt.

根据上述方案的本发明,提供一种较佳实施例一体化录播摄像头主板结构示意图,如图5所示,用于视频处理功能、网络直播功能、录制存储功能、视频点播功能的主机处理模块,其主处理芯片优选海思HI3531A芯片;用于4G网络功能的4G网络模块,其芯片优选MT7620系列芯片,通过SIM卡提供数据达到4G网络传输;用于视频分析功能和运动跟踪功能的视频分析DSP模块,其芯片优选TMS320系列芯片;用于云台控制功能的云台控制模块和摄像头控制模块,其芯片优选STM32系列芯片,通过两路RS232分别控制云台和摄像头相关运动;本地数据传输模块一方面可直接通过HDMI接口向外传输数据,另一方面可通过GV7600芯片并行输出转串行输出,再通过SDI接口向外传输数据;除此之外,摄像头主板提供SATA接口和SD卡接口,用于本地视频储存。According to the present invention of above-mentioned scheme, provide a kind of structural schematic diagram of mainboard of integrated recording and broadcasting camera of preferred embodiment, as shown in Figure 5, be used for the host computer processing module of video processing function, network live broadcast function, recording storage function, video on demand function , the main processing chip is preferably Hisilicon HI3531A chip; the 4G network module used for 4G network function, the chip is preferably MT7620 series chip, and the data provided by the SIM card can achieve 4G network transmission; video analysis for video analysis function and motion tracking function DSP module, whose chip is preferably TMS320 series chip; the PTZ control module and camera control module used for PTZ control function, its chip is preferably STM32 series chip, which controls the related movement of PTZ and camera respectively through two RS232 channels; local data transmission module On the one hand, the data can be transmitted directly through the HDMI interface, on the other hand, the parallel output can be converted to serial output through the GV7600 chip, and then the data can be transmitted externally through the SDI interface; in addition, the main board of the camera provides a SATA interface and an SD card interface. For local video storage.

根据上述方案的本发明,所述网络直播功能基于4G网络模块访问互联网,用户可以通过外网直接访问该录播主机的实时流媒体,录播主机采用RTMP实时流媒体协议传输音视频实时流,其基本流程如下:According to the present invention of above-mentioned scheme, described live network function is based on 4G network module access Internet, and the user can directly visit the real-time streaming media of this recording and broadcasting host through external network, and recording and broadcasting host adopts RTMP real-time streaming media protocol to transmit audio and video real-time streaming, The basic process is as follows:

S01:录播主机通过视频处理模块获取实时音视频流,并且将其编码成特定的格式,音频编码成AAC格式,视频编码成H.264格式。S01: The recording and broadcasting host obtains real-time audio and video streams through the video processing module, and encodes them into a specific format, the audio is encoded into the AAC format, and the video is encoded into the H.264 format.

S02:将编码后的音视频数据按照FLV格式封装,使其符合RTMP传输标准。S02: Encapsulate the encoded audio and video data according to the FLV format, so as to conform to the RTMP transmission standard.

S03:使用LibRTMP流媒体推流框架,将实时流推送到RTMP流媒体服务器。S03: Use the LibRTMP streaming media streaming framework to push the real-time stream to the RTMP streaming media server.

S04:RTMP流媒体播放器播放实时流媒体。S04: RTMP streaming media player to play live streaming media.

根据上述方案的本发明,视频分析DSP模块对视频信号中的目标进行分类识别。According to the present invention of the above solution, the video analysis DSP module classifies and recognizes the targets in the video signal.

对目标的分类识别包括教师与学生的身份识别以及教师与学生在课堂场景下的特定行为的识别。The classification and identification of targets includes the identification of teachers and students and the identification of specific behaviors of teachers and students in classroom scenarios.

针对教师,其特定行为包括教师板书、教师提问以及教师徘徊;针对学生,其特定行为包括学生举手、起立和坐下。For teachers, the specific behaviors include writing on the blackboard, asking questions, and wandering; for students, the specific behaviors include raising hands, standing up and sitting down.

对于课堂场景下教师与学生的行为分析,其通过训练相应的分类器来实现行为的分类识别,具体方法如下:For the behavior analysis of teachers and students in the classroom scene, it realizes the classification and recognition of behavior by training the corresponding classifier. The specific method is as follows:

S1:训练分类器:S1: Train the classifier:

S11:获取课堂场景下大量的图片训练样本数据集,其包括各种需要识别的动作信息,如教师板书、教师提问、教师徘徊、学生举手、学生起立、学生坐下;S11: Obtain a large number of picture training sample data sets in classroom scenes, which include various action information that needs to be recognized, such as teacher writing on the blackboard, teacher asking questions, teacher wandering, students raising their hands, students standing up, and students sitting down;

S12:提取样本特征,如使用SIFT特征、HOG特征以及LBP特征。S12: Extract sample features, such as using SIFT features, HOG features, and LBP features.

S13:制作样本训练集,将样本提取后的特征与标签对应。S13: Make a training set of samples, and map the features extracted from the samples to the labels.

S14:建立分类器模型,如SVM分类器,K近邻分类器或者贝叶斯分类器。S14: Establish a classifier model, such as an SVM classifier, a K-nearest neighbor classifier or a Bayesian classifier.

S15:将样本训练集进行分类器训练,终止条件为达到预定精度或者达到预定训练次数。S15: Perform classifier training on the sample training set, and the termination condition is reaching a predetermined accuracy or reaching a predetermined number of training times.

S2:使用训练好的分类器进行课堂行为动作识别:S2: Use the trained classifier for classroom behavior recognition:

S21:获取实时视频,并采用帧差法或者背景法获取运动目标区域。S21: Obtain a real-time video, and acquire a moving target area by using a frame difference method or a background method.

S22:对运动目标区域进行特征提取,如SIFT特征、HOG特征以及LBP特征,与样本制作方式保持一致。S22: Perform feature extraction on the moving target area, such as SIFT features, HOG features, and LBP features, consistent with the sample production method.

S23:将提取的特征使用分类器进行分类,获得分类结果,对应的即可获取相应的行为动作类别。S23: Use the classifier to classify the extracted features, obtain the classification result, and obtain the corresponding behavior category accordingly.

根据上述方案的本发明,所述的运动跟踪功能可用于教师行为的跟踪或者学生行为的跟踪;所述教师行为的跟踪,主要包括教师徘徊时云台摄像机的跟踪运动;所述学生行为的跟踪,主要包括学生站立时摄像头的变焦镜头特写以及学生坐下时摄像头的变焦镜头释放。运动跟踪是基于行为识别实现的,先根据视频分析DSP模块获取对应的行为动作,并根据具体的动作类型执行相应的跟踪类型,如动作类型为教师徘徊,则启动云台控制模块,控制云台运动,以达到运动跟踪的效果;如动作类型为学生起立,则同时控制云台控制模块与摄像头控制模块,首先云台控制模块控制云台运动到对应学生位于图像正中间,然后摄像头控制模块控制变焦镜头特写,将镜头拉近以达到学生特写的效果。According to the present invention of above-mentioned scheme, described motion tracking function can be used for the tracking of teacher's behavior or the tracking of student's behavior; The tracking of described teacher's behavior mainly comprises the tracking movement of pan-tilt camera when teacher wanders; The tracking of described student's behavior , mainly including the zoom lens close-up of the camera when the student is standing and the zoom lens release of the camera when the student is sitting down. Motion tracking is realized based on behavior recognition. First, the corresponding behavior is obtained according to the video analysis DSP module, and the corresponding tracking type is executed according to the specific action type. Movement to achieve the effect of motion tracking; if the action type is for students to stand up, control the pan/tilt control module and camera control module at the same time. Zoom lens close-up, zoom in the lens to achieve the effect of close-up of students.

根据上述方案的本发明,所述的教学录播一体机各个功能模块实现的流程图如图6所示,所述的一体机系统开启时,可以选择录制或者点播;系统选择录制时,一方面摄像头采集的音视频通过主处理单元进行音视频编码,然后音视频封装成flv格式视频,分别用于本地存储、网络直播和本地播放,另一方面摄像头采集的视频信号用于视频运动分析,分析的结果用来运动跟踪;系统选择点播时,通过4G网络访问本地存储,选择需要观看的录制记录,下载点播视频到本地,完成播放。According to the present invention of the above-mentioned scheme, the flow chart of the implementation of each functional module of the teaching recording and broadcasting all-in-one machine is shown in Figure 6. When the all-in-one machine system is turned on, recording or on-demand can be selected; when the system selects recording, on the one hand The audio and video collected by the camera is encoded by the main processing unit, and then the audio and video are packaged into flv format video, which are used for local storage, webcast and local playback respectively. On the other hand, the video signal collected by the camera is used for video motion analysis, analysis The result is used for motion tracking; when the system selects on-demand, it accesses the local storage through the 4G network, selects the recording record to be watched, downloads the on-demand video to the local, and completes the playback.

以上所述仅为本发明的优选实施例,并非因此限制本发明的专利范围,凡是利用本发明说明书及附图内容所作的等效结构或等效流程变换,或直接或间接运用在其他相关的技术领域,均同理包括在本发明的专利保护范围内。The above descriptions are only preferred embodiments of the present invention, and are not intended to limit the patent scope of the present invention. Any equivalent structure or equivalent process transformation made by using the description of the present invention and the contents of the accompanying drawings, or directly or indirectly used in other related All technical fields are equally included in the scope of patent protection of the present invention.

Claims (10)

1.一种云台摄像智能分析教学录播一体机,其特征在于,包括云台和搭载在云台上的摄像头单元;1. A cloud platform camera intelligent analysis teaching recording and broadcasting all-in-one machine is characterized in that it comprises a cloud platform and a camera unit mounted on the platform; 所述摄像头单元包括摄像头、主机处理模块4G网络模块、视频分析DSP模块、摄像头控制模块、云台控制模块;The camera unit includes a camera, a host processing module 4G network module, a video analysis DSP module, a camera control module, and a cloud platform control module; 所述摄像头,用于采集教学场景的视频信号和音频信号;The camera is used to collect video signals and audio signals of teaching scenes; 所述主机处理模块,用于接收视频信号和音频信号,对视频和音频进行编码,封装,封装后的音视频一方面储存于本地,另一方面通过4G网络模块将音视频以流媒体形式推出;The host processing module is used to receive video signals and audio signals, encode the video and audio, and encapsulate the audio and video. ; 所述视频分析DSP模块,用于接收视频信号,对视频信号中的目标进行分类识别,根据分类识别的结果,输出运动控制信号给云台控制模块,输出摄像头对焦信号给摄像头控制模块,从而对目标进行实时跟踪;Described video analysis DSP module, is used for receiving video signal, carries out classification recognition to the target in video signal, according to the result of classification recognition, output motion control signal to cloud platform control module, output camera focusing signal to camera control module, thereby to Real-time tracking of targets; 摄像头控制模块,用于控制摄像头的对焦;The camera control module is used to control the focus of the camera; 云台控制模块,用于控制云台的运动。The pan-tilt control module is used to control the motion of the pan-tilt. 2.根据权利要求1所述的一种云台摄像智能分析教学录播一体机,其特征在于,在视频分析DSP模块中,对目标的分类识别包括教师与学生的身份识别以及教师与学生在课堂场景下的特定行为的识别。2. a kind of pan-tilt camera intelligent analysis teaching recording and broadcasting all-in-one machine according to claim 1, is characterized in that, in the video analysis DSP module, the classification recognition to target comprises the identification of teacher and student and teacher and student Recognition of specific behaviors in classroom scenarios. 3.根据权利要求2所述的一种云台摄像智能分析教学录播一体机,其特征在于,针对教师,其特定行为包括教师板书、教师提问以及教师徘徊;针对学生,其特定行为包括学生举手、起立和坐下。3. A kind of PTZ camera intelligent analysis teaching recording and broadcasting all-in-one machine according to claim 2, characterized in that, for teachers, its specific behaviors include teachers writing on the blackboard, teachers asking questions and teachers wandering; for students, its specific behaviors include student Raise your hands, stand up and sit down. 4.根据权利要求3所述的一种云台摄像智能分析教学录播一体机,其特征在于,对目标进行实时跟踪包括教师行为的跟踪或者学生行为的跟踪;所述教师行为的跟踪,主要包括教师徘徊时云台摄像机的跟踪运动;所述学生行为的跟踪,主要包括学生站立时摄像头的变焦镜头特写以及学生坐下时摄像头的变焦镜头释放。4. a kind of pan-tilt camera intelligent analysis teaching recording and broadcasting all-in-one machine according to claim 3, is characterized in that, carrying out real-time tracking to target comprises the tracking of teacher's behavior or the tracking of student's behavior; The tracking of described teacher's behavior mainly Including the tracking movement of the pan-tilt camera when the teacher is wandering; the tracking of the student's behavior mainly includes the close-up of the zoom lens of the camera when the student is standing and the release of the zoom lens of the camera when the student is sitting down. 5.根据权利要求1-4任一项所述的一种云台摄像智能分析教学录播一体机,其特征在于,所述对视频信号中的目标进行分类识别,通过训练相应的分类器来实现,具体方法如下:5. according to claim 1-4 described in any one of claim 1-4, it is characterized in that, the object in the video signal is classified and identified, and the corresponding classifier is trained to To achieve, the specific method is as follows: S1:训练分类器:S1: Train the classifier: S11:获取课堂场景下图片训练样本数据集,数据集中包括各种需要识别的动作信息,动作信息:教师板书、教师提问、教师徘徊、学生举手、学生起立、学生坐下;S11: Obtain the image training sample data set in the classroom scene. The data set includes various action information that needs to be recognized. Action information: teacher writing on blackboard, teacher asking questions, teacher wandering, students raising hands, students standing up, students sitting down; S12:提取样本特征;S12: Extract sample features; S13:制作样本训练集,将样本提取后的特征与标签对应;S13: Make a sample training set, and correspond the features extracted from the samples to the labels; S14:建立分类器模型;S14: Establish a classifier model; S15:将样本训练集进行分类器训练,终止条件为达到预定精度或者达到预定训练次数;S15: Perform classifier training on the sample training set, and the termination condition is to reach a predetermined accuracy or reach a predetermined number of training times; S2:使用训练好的分类器进行课堂行为动作分类识别:S2: Use the trained classifier to classify and recognize classroom behavior actions: S21:获取实时视频,并获取运动目标区域;S21: Obtain real-time video, and obtain a moving target area; S22:对运动目标区域进行特征提取;S22: Perform feature extraction on the moving target area; S23:将提取的特征使用步骤S1训练得到的分类器进行分类,获得分类结果,对应的即可获取相应的行为动作类别。S23: Use the classifier trained in step S1 to classify the extracted features to obtain classification results, and corresponding behavior categories can be obtained accordingly. 6.根据权利要求6所述的一种云台摄像智能分析教学录播一体机,其特征在于,所述S12中,提取样本特征使用SIFT特征、HOG特征以及LBP特征。6. A kind of pan-tilt-camera intelligent analysis, teaching, recording and broadcasting all-in-one machine according to claim 6, characterized in that, in said S12, extracting sample features uses SIFT features, HOG features and LBP features. 7.根据权利要求6所述的一种云台摄像智能分析教学录播一体机,其特征在于,所述S14中,分类器模型采用SVM分类器,K近邻分类器或贝叶斯分类器。7. A kind of pan-tilt camera intelligent analysis teaching recording and broadcasting all-in-one machine according to claim 6, characterized in that, in said S14, the classifier model adopts SVM classifier, K-nearest neighbor classifier or Bayesian classifier. 8.根据权利要求1所述的一种云台摄像智能分析教学录播一体机,其特征在于,所述摄像头单元还包括本地数据传输模块,主机处理模块封装后的音视频通过本地数据传输模块直接传输到本地显示。8. A kind of pan-tilt camera intelligent analysis teaching recording and broadcasting all-in-one machine according to claim 1, characterized in that, the camera unit also includes a local data transmission module, and the audio and video after the host processing module encapsulation passes through the local data transmission module Transfer directly to local display. 9.根据权利要求1所述的一种云台摄像智能分析教学录播一体机,其特征在于,所述4G网络模块能直接访问互联网,用户通过外网直接访问实时流媒体。9. The all-in-one machine for intelligent analysis, teaching, recording and broadcasting of PTZ cameras according to claim 1, wherein the 4G network module can directly access the Internet, and the user directly accesses real-time streaming media through the external network. 10.根据权利要求1所述的一种云台摄像智能分析教学录播一体机,其特征在于,所述的云台为悬挂式的一体化云台,其水平方向运动采用蜗轮蜗杆传动方式,俯仰方向运动采用行星轮传动方式。10. The all-in-one machine for intelligent analysis, teaching, recording and broadcasting of a pan-tilt camera according to claim 1, wherein the pan-tilt is a suspended integrated pan-tilt, and its horizontal motion adopts a worm gear transmission mode, The movement in the pitch direction adopts planetary gear transmission.
CN201910588415.8A 2019-07-01 2019-07-01 A PTZ camera intelligent analysis teaching recording and broadcasting integrated machine Expired - Fee Related CN110266984B (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN201910588415.8A CN110266984B (en) 2019-07-01 2019-07-01 A PTZ camera intelligent analysis teaching recording and broadcasting integrated machine

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN201910588415.8A CN110266984B (en) 2019-07-01 2019-07-01 A PTZ camera intelligent analysis teaching recording and broadcasting integrated machine

Publications (2)

Publication Number Publication Date
CN110266984A true CN110266984A (en) 2019-09-20
CN110266984B CN110266984B (en) 2020-12-18

Family

ID=67923717

Family Applications (1)

Application Number Title Priority Date Filing Date
CN201910588415.8A Expired - Fee Related CN110266984B (en) 2019-07-01 2019-07-01 A PTZ camera intelligent analysis teaching recording and broadcasting integrated machine

Country Status (1)

Country Link
CN (1) CN110266984B (en)

Cited By (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN115460379A (en) * 2022-09-05 2022-12-09 南京逸智网络空间技术创新研究院有限公司 Teaching recording and broadcasting guide system and method based on Haesi embedded platform

Citations (10)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN101635835A (en) * 2008-07-25 2010-01-27 深圳市信义科技有限公司 Intelligent video monitoring method and system thereof
CN103905734A (en) * 2014-04-17 2014-07-02 苏州科达科技股份有限公司 Method and device for intelligent tracking and photographing
US9113212B2 (en) * 1998-05-06 2015-08-18 Tivo Inc. Simultaneous recording and playback of audio/video programs
CN204669511U (en) * 2015-05-04 2015-09-23 广州盈可视电子科技有限公司 A kind of automatic recorded broadcast tracking system of integration
CN105139702A (en) * 2015-10-14 2015-12-09 广州天莱软件科技有限公司 Recording and broadcasting system used for teaching and use method thereof
CN106096666A (en) * 2016-06-24 2016-11-09 惠州紫旭科技有限公司 A kind of method and apparatus reducing recording and broadcasting system students ' behavior analysis erroneous judgement
CN205827430U (en) * 2016-04-19 2016-12-21 深圳正谱云教育技术有限公司 Camera to automatically track system based on single-lens image Dynamic Recognition
CN106803913A (en) * 2017-03-10 2017-06-06 武汉东信同邦信息技术有限公司 A kind of detection method and its device of the action that taken the floor for Auto-Sensing student
CN107105207A (en) * 2017-06-09 2017-08-29 北京深瞐科技有限公司 Target monitoring method, target monitoring device and video camera
CN108229352A (en) * 2017-12-21 2018-06-29 上海交通大学 A kind of standing detection method based on deep learning

Patent Citations (10)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US9113212B2 (en) * 1998-05-06 2015-08-18 Tivo Inc. Simultaneous recording and playback of audio/video programs
CN101635835A (en) * 2008-07-25 2010-01-27 深圳市信义科技有限公司 Intelligent video monitoring method and system thereof
CN103905734A (en) * 2014-04-17 2014-07-02 苏州科达科技股份有限公司 Method and device for intelligent tracking and photographing
CN204669511U (en) * 2015-05-04 2015-09-23 广州盈可视电子科技有限公司 A kind of automatic recorded broadcast tracking system of integration
CN105139702A (en) * 2015-10-14 2015-12-09 广州天莱软件科技有限公司 Recording and broadcasting system used for teaching and use method thereof
CN205827430U (en) * 2016-04-19 2016-12-21 深圳正谱云教育技术有限公司 Camera to automatically track system based on single-lens image Dynamic Recognition
CN106096666A (en) * 2016-06-24 2016-11-09 惠州紫旭科技有限公司 A kind of method and apparatus reducing recording and broadcasting system students ' behavior analysis erroneous judgement
CN106803913A (en) * 2017-03-10 2017-06-06 武汉东信同邦信息技术有限公司 A kind of detection method and its device of the action that taken the floor for Auto-Sensing student
CN107105207A (en) * 2017-06-09 2017-08-29 北京深瞐科技有限公司 Target monitoring method, target monitoring device and video camera
CN108229352A (en) * 2017-12-21 2018-06-29 上海交通大学 A kind of standing detection method based on deep learning

Cited By (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN115460379A (en) * 2022-09-05 2022-12-09 南京逸智网络空间技术创新研究院有限公司 Teaching recording and broadcasting guide system and method based on Haesi embedded platform

Also Published As

Publication number Publication date
CN110266984B (en) 2020-12-18

Similar Documents

Publication Publication Date Title
Morgado et al. Learning representations from audio-visual spatial alignment
CN112562433B (en) A working method of 5G strong interactive remote delivery teaching system based on holographic terminal
CN109698920B (en) Follow teaching system based on internet teaching platform
CN105306862B (en) A kind of scene video recording system based on 3D dummy synthesis technology, method and scene real training learning method
CN103905734A (en) Method and device for intelligent tracking and photographing
CN202601002U (en) A recording and playing system with manual and automatic operations
CN113691836A (en) Video template generation method, video generation method and device and electronic equipment
CN114638732A (en) An artificial intelligence intelligent education platform and its application
CN104469304A (en) Intelligent recording and playing system for performance training
CN107864354A (en) A kind of method of electronic whiteboard intelligence recorded broadcast
CN110266984B (en) A PTZ camera intelligent analysis teaching recording and broadcasting integrated machine
CN114339197A (en) Video playback test method, device and equipment
CN108831220A (en) A kind of interaction multimedia tutoring system based on speech recognition
CN113315980A (en) Intelligent live broadcast method and live broadcast Internet of things system
CN112235605A (en) Video processing system and video processing method
CN109862375B (en) Cloud recording and broadcasting system
CN110933350A (en) Electronic cloud mirror recording and broadcasting system, method and device
CN204334840U (en) An intelligent teaching and recording system
CN119011753A (en) Teaching system and method based on AI intelligent tracking analysis
CN118764693A (en) Method, device, equipment and storage medium for generating video blog
CN217543870U (en) Interactive teaching classroom system
CN107134178A (en) A kind of music initiation learning device and method based on augmented reality
CN209002068U (en) Intelligent cloud mirror video camera
CN103428441A (en) Course recording method and course recording device used for on-line teaching
CN203827424U (en) SDI gun camera supporting tracking teacher in education recorded broadcast

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination
GR01 Patent grant
GR01 Patent grant
CF01 Termination of patent right due to non-payment of annual fee

Granted publication date: 20201218

CF01 Termination of patent right due to non-payment of annual fee
点击 这是indexloc提供的php浏览器服务,不要输入任何密码和下载