key: cord-0028075-pi2ahmyo authors: Sun, Jia title: A Digital Feature Recognition Technology Used in Ballet Training Action Correction date: 2022-02-25 journal: Comput Intell Neurosci DOI: 10.1155/2022/7953172 sha: 67e66d1ddf248b4e6395e29d513174524e2b3ee2 doc_id: 28075 cord_uid: pi2ahmyo In order to improve the effect of ballet training, this paper combines digital technology to improve the motion recognition process, analyzes the imaging process, analyzes the system process combined with the human node model, and combines digital feature recognition technology to construct a ballet training motion correction system. Moreover, this paper inputs the ballet training action recognition results into the system and compares the standard actions to judge the rationality of the actions and builds a ballet training action correction system based on digital feature recognition. In addition, this paper designs experiments to conduct ballet training movements in the system effect evaluation. The experimental research verifies that the digital feature recognition technology proposed in this paper can play an important role in ballet movement recognition and has a good action correction effect. Ballet is a charming and refreshing art. Its elegance and beauty can be regarded as the essence of human civilization and progress, and it is praised as "the crown jewel of dance art" [1] . Its graceful dance, light jumping, and continuous toe rotation often bring people into a dreamlike artistic realm. Ballet originated in Italy during the European Renaissance, flourished in France, and flourished in Russia, and finally moved from Russia to the world. After hundreds of years of development, ballet has formed its own complete training system. In particular, it is unique in the scientific training methods of the correct sense of the feet, posture, various parts of the body, and the consciousness of movement performance [2] . e ballet training system is to improve the performer's physical ability and performance awareness through a series of complete programmatic body movement combination training. Moreover, it enables the dancers to freely and colorfully express the performers' rich thoughts and passions in various situations, thereby creating a brandnew artistic image [3] . e romantic ballet showed the body aesthetics so vividly that the criterion for judging this beauty has continued to this day. e basic skill training of Lei can be summarized as the four principles of opening, stretching, upright, and standing, which are basically consistent with the requirements of the competition rules for the body technical movement specifications of the artistic performances. Basic ballet training has a positive effect on the aesthetics and standardization of Latin dance. erefore, the basic training of ballet is very important for the study and training of Latin dance. At present, there are still many problems that need to be improved in actual teaching: firstly, there is no ballet training material suitable for special and unified sports dance; secondly, the teaching hours are generally too short to meet the special requirements; finally, the goal of ballet training is pertinent not strong, ignoring the training needs of each item. Only by further studying the value and role of basic ballet training in Latin dance teaching, strengthening the thinking of the degree of integration of basic dance and professional dance, and improving the awareness of basic dance services for special dances, can the teaching effect be better improved. Ballet dynamic technology is to improve the dancers' technical movements and physical abilities through a series of complete and standardized body movements, so as to better show the dance style and dance performance ability, so that the dancers can be in a variety of music rhythms. is kind of emotions can show graceful dance with ease, convey the most accurate emotional expression to the audience, show the most exquisite technical movements, and have the characteristics of science and standardization. Ballet dynamic techniques cover jumping, squatting, turning, and other techniques. In foreign countries, many modern dancers have been trained in ballet since they were young, so they have solid ballet dynamic skills, making the basic dance skills much better than ordinary people, and the performance of modern dance competitions is also very good. is article combines digital feature recognition technology to carry out ballet training movement correction analysis and combines the actual ballet movement to correct the actual needs for research and proposes a corresponding intelligent training correction system to improve the effect of ballet training. Literature [4] introduced the importance of ballet in the dance classroom and then analyzed the problems of unreasonable curriculum setting, single teaching method, and insufficient teachers in the reform of ballet classroom teaching. In response to these problems, corresponding solutions are put forward to help the ballet classroom to be better developed and play its role. Literature [5] pointed out that ballet is a must-have course for dance practitioners. It can not only affect and improve the beauty of people, but also make people have beauty of posture and temperament. It is precisely because of a scientific and standardized system such as ballet that the flowers of dance art can bloom more brilliantly. Literature [6] mentioned that the technical level of modern ballet can be divided into dynamic technology and static technology. Similarly, sports dance also includes dynamic movements and static movements. Dynamic movements require agility and coordination, which not only involves complex steps, but also requires physical sensations and delicate and changeable techniques. Literature [7] believed that the basic ballet training course is an indispensable course for dance majors, and it is also a major course in the ballet teaching system. e most prominent feature is that it not only trains various positions of the body, but also plays a comprehensive training role in the coordination of the entire body. Gao Yiyan said that dance is the art of the body, and the courses to exercise the dancers' bodies are very important. e reason why basic ballet training can be regarded as one of the compulsory basic courses for all dance majors is that basic ballet training is the longest and most scientific basic training course in history. e purpose of the basic ballet training course is to train students' muscles, skills, abilities, etc., to help students lay a foundation for future performances [8] . Literature [9] pointed out that there are several problems in the development process of dance movement in the emerging stage, such as the decline of dance movement athletes' physical fitness, low technical level, unnew arrangement, and poor completion, and proposed development strategies to improve dance movement items. First of all, establish a scientific and complete training plan, and insist on scientific training for a long time. Secondly, strengthen the coach's own quality and coaching ability, continue to promote and popularize dance movement training, and expand the number of reserve talents to provide talents for the national team. Literature [10] made a preliminary popularization of the origin and development form of dance movements to other scholars who did not understand the development of dance movements and let people gradually understand the development history and essential characteristics of dance movements. e article mentioned that both now and in the future, only the combination of sports and art can continuously create new results. In particular, modern people pursue a healthy lifestyle and the beauty of sports. erefore, it is necessary to strengthen the popularization of people to shape a healthy lifestyle through sports. Literature [11] proposes that only coaches should follow the latest trends in the development of dance movement items and, on the premise of accurately understanding and grasping the scoring rules, propose and implement corresponding training for the common problems of dance movement athletes at this stage and the athletes' physical conditions. Only plans and countermeasures can adapt to the rules of international dance movement competitions and promote the development of dance movement projects in the direction of art and competition that is difficult and beautiful. Literature [12] refers to the limitation of sports events on the road to the emergence of institutional development, forcing sports events to be unable to continue their development. Literature [13] writes that dance moves are still a young athletic event, so there is a lot of room for improvement. e complete set of dance moves is not only a superimposition of difficulty, but also a perfect match with the equipment. erefore, the individual characteristics of the project should be fully considered during training, combined with the athlete's physical ability, and appropriate auxiliary method training should be strengthened, including basic quality training such as flexibility, strength, speed, endurance, agility, etc. It also includes the characteristics of different equipment for training to integrate and complement each other. Literature [14] writes that coaches emphasize physical fitness while neglecting equipment skills in training. Poor proficiency directly affects the complete quality of the entire set. If the proficiency training in equipment movements, continuity, and coordinated movements is not strengthened, it will be restricted. For the progress of the dance movement project, [15] pointed out that it has begun to pay attention to the research of athlete selection, movement arrangement, physical training, etc. and strictly train outstanding athletes. Literature [16] writes that it is important to scientifically and rationally arrange the training plans for each stage of the small, medium, and large cycles, intersperse competitions, assessments, relaxation, and other links in the quality training, and combine the large and small cycles, so that the cycle training can achieve excellent results. Literature [17] emphasizes the balanced development of various difficult movements, attaches importance to difficult movements and balance movements, strictly regulates movement standards, strengthens the time of equipment training, participates in more international competitions, accumulates experience, and makes use of the good model of competition-driven training. Multiview dense matching uses homography to perform dual-view stereo matching between multiple cameras. e homography relationship can be expressed as follows: a point in a certain image plane can determine the position of its corresponding point in the second image plane through the homography induced by a certain plane. As shown in Figure 1 , a point x 1 in the image plane of the camera C 1 can be used to determine the position of the matching point x 2 in the image plane of the camera C 2 through the homography H induced by the π plane. e calculation method of the homography matrix H is as follows: We assume that the projection matrices of camera C 1 and camera C 2 are P 1 and P 2 , respectively. First, we backproject the plane point x, and the back-projection ray intersects the plane π at the three-dimensional point X, so there is en, we map the three-dimensional point X projection to the plane point x 2 and denote it as x 2 � P 2 X. At this time, the homography correspondence between the plane points x 1 and x 2 can be obtained as [18] We assume that the projection matrices of camera C 1 and camera C 2 are P 1 � K 1 [I | 0] and P 2 � K 2 [R | t], respectively, and the π plane can be expressed as π � (n T , 1) T (n is a certain unit vector). Any three-dimensional point on the projection ray of the plane point x1 can be expressed as x � (x 1 , η) T , where η is the parameterized coefficient of the point X on the projection ray. Since the three-dimensional point X is on the plane π at the same time, there is π T X � 0, which is transformed into X � (x 1 , −n T x 1 ) T . erefore, formula (1) can be further expressed as [19] It is composed of the internal parameter matrix of the camera, the relative external parameters between the cameras, and the plane parameters. e schematic diagram of the spatial plane scanning algorithm is shown in Figure 2 . In Figure 2 , we assume that there are a multicamera image I i (i � 1, . . . , n) and a series of parallel virtual planes Z j (j � 1, . . . , m) scanned along the Z axis in space. In the algorithm, each virtual plane is divided into cells of a certain size, and the cell size determines the accuracy of the threedimensional points in the reconstruction result. e principle of the spatial plane scanning algorithm is that the algorithm back-projects the image feature point x i , and the back-projected rays intersect the corresponding cell of each virtual plane. e algorithm performs back-projection operation on all image feature points and records the number of times that all cells are intersected. e algorithm counts the total number of intersections of each cell, and the cell with the number of intersections greater than the preset threshold is regarded as the three-dimensional point X corresponding to the multiview matching. At the same time, the image point x i (i � 1, . . . , n) corresponding to the threedimensional point transmission projection is the matching point of the multiview image. e image point x i (x, y) T and the three-dimensional point X � (X, Y, Z, 1) T in space are both representations of homogeneous coordinates. e process of projecting the point X in space onto a certain camera image plane I i is [20] x i � K r 1 r 2 r 3 t X. (3) K in the equation is the camera's internal parameter matrix. Crossover R � r 1 r 2 r 3 and t are the external parameters of the camera. Since the three-dimensional point X in space lies on the plane Z f , (3) can be replaced by In this way, the projective transformation is transformed into a homography transformation of two planes, and the homography matrix H j is [21] In order to calculate the intersection position of the back-projection ray of the image point and all virtual planes more effectively, we propose to precalculate the intersection (X 1 , Y 1 , Z 1 ) T of the back-projection ray of the image point and a certain virtual plane Z 1 . After that, we directly perform a homography transformation on (X 1 , Y 1 , Z 1 ) T to solve the intersection point (X j , Y j , Z j ) between the back-projected ray and other virtual planes, instead of retransforming the source image point. e calculation expression is We assume a pixel p and its surrounding neighborhood pixel square N(p); the intensity of the pixel is expressed as I(p). Nonparametric local transformation is to compare the intensity value I(p ′ ) of pixel p ′ (p ′ ∈ N(p)) in the neighborhood square matrix with I(p) and define the sign as On this basis, the Rank transformation is expressed as It refers to the number of pixels in the neighborhood square matrix whose pixel intensity value is less than the center pixel intensity value. When calculating region matching, the LI norm relationship is used for minimization, and the center pixel corresponding to the region with the smallest number after comparison is selected as the matching pixel. It directly performs hierarchical similarity comparison of color images and adopts a multiwindow comparison method to improve the matching accuracy in the case of discontinuous parallax. Different from the Rank transform, the idea of the Census transform is to compare the intensity value difference between the pixels in the neighborhood square matrix and the center pixel and express the comparison result as a binary bit string in order. e mathematical expression of this transformation is In the formula, the symbol ⊗ represents the concatenation of each value ξ(p ′ , p). e schematic diagram of the Census transformation is shown in Figure 3 , where the pixels in the red dashed frame in the figure are the neighborhood pixel square matrix N(p). Because the Census transform not only uses nonparametric binary characters to represent the attributes of the pixels, but also incorporates the order of the neighboring pixel attributes into the constraints, it is more robust than the Rank transform in the face of changes in external lighting conditions. After the Census transformation of the multiview image to be matched, the Hamming distance between the pixels to be matched is calculated to achieve multiview stereo matching. e Hamming distance refers to the number of different bits between two bit strings. It is very intuitive to express the degree of similarity between pixels. e method of using Hamming distance as the matching criterion can be expressed as In the formula, Cost(p d) represents the matching cost of the pixel p to be matched at the disparity d, and C 1 and C 2 are the bit strings of the two viewpoint images after the Census transformation. In the case of discontinuous parallax, usually a point-bypoint matching combined with global optimization is used for processing to reduce the erroneous matching caused by noise and obtain more accurate matching results. e global optimization in this method adds a smoothing term as an additional constraint condition on the basis of the matching cost, so as to punish the change of the neighborhood parallax, which can be expressed mathematically as When the neighborhood disparity change |D p − D p′ | is greater than or equal to one pixel, a constant penalty coefficient p is added to preserve the discontinuity of the disparity. e stereo matching problem combined with the global method becomes the problem of minimizing the global energy function E(D) and solving the parallax D. However, (11) is a two-dimensional global energy function, and it is a complicated process to minimize it. Although the idea of dynamic programming can be used to efficiently minimize one-dimensional image rows one by one, it is still difficult to achieve optimal association between rows. Different from the energy function of global matching, the energy function of semiglobal matching uses two smoothing coefficients p1 and p2 of different sizes to penalize different degrees of parallax changes, which is mathematically expressed as e p1 in formula (12) penalizes a small range of parallax (for example, the parallax is one pixel) and can adaptively preserve the inclined surface or curved surface, while p2, which penalizes large parallax, can preserve the discontinuous area. e most important thing is that the semiglobal matching adopts a new matching cost accumulation method, which equally accumulates the one-dimensional matching cost of all directions of the pixels, and uses the minimum cost sum of all directions as the final cumulative matching cost. e minimum cost of a path in a certain direction can be expressed as In the formula, C(p,d) represents the point-by-point matching cost, r represents the directional path, p−r is the pixel before the pixel in the r direction, and d represents the parallax. e final cumulative matching cost is expressed as e actual application process usually accumulates the minimum cost of 8 or 16 directional paths, as shown in Figure 4 . e paths in the eight directions are row accumulation, column accumulation, and diagonal accumulation. In addition to the rows, columns, and diagonals of the 16-direction paths, the accumulation method of the remaining 8 directions is to first perform row or column accumulation, then perform diagonal accumulation, and perform alternately. Finally, the cumulative matching cost energy function Cost agg (p, d) is minimized, and the matching disparity value d corresponding to each pixel is obtained, and the matching is completed. In the field of computer vision, applications such as image denoising, stereo image dense matching, and depth generation all have erroneous estimates due to external noise. For this reason, outliers are usually eliminated. e principle of outlier removal can be understood from the perspective of probability models: g(x) is a series of noisy measurement values; u(x) ∈ R M×N is the solution of the required model. According to Bayesian inference, the probability of obtaining the model solution u(x) optimized from the measurement value b is expressed as In formula (15) , p(g | u) is called the data generation model, and p(u) is called the prior model. e denominator p(g) is a constant when the measured value is known, so p(u | g) is proportional to p(g | u)p(u) and can be expressed as p(g | u) ∝ p(g | u)p(u). erefore, the target solution u can be solved by maximizing the posterior probability p(u | g): Most practical engineering applications will use global optimization methods to eliminate outliers. e global optimization method sets E(u) � −ln p(u | g) to convert the probability maximization problem to the energy minimization problem. ere are Furthermore, global optimization usually replaces the probability model with a general energy model, which consists of data items containing model solutions and regular items with smoothing effect, expressed as e data item E(u) � Ω ψ D (es(u | x), g(x))dx + τ Ω ψ D (∇u(x))dx in equation (19) is proportional to −ln p(u), and the regular term R(u) is proportional to D(u | g). e data item consists of the error function e induced by the model solution u(x) and the measured value g(x) and the positive penalty function ψ D , expressed as Computational Intelligence and Neuroscience e regular term is composed of the smooth function of the model solution u(x) and the positive penalty function v, expressed as R(u) � x∈Ω ψ R (s(u(x))). (20) e value of the regular term is small, which plays a role in preserving certain characteristics of the target solution. In the continuous domain, the smoothing of the model solution Finally, a variational global optimization energy model can be defined to solve the problem of removing outliers. e energy model is mathematically expressed as e solution method of formula (21) is to use the Euler-Lagrangian equation zE(u)/zu � 0 which minimizes the energy functional to find the minimum value of the model solution u(x). In particular, when both the data item and the regular item are convex, the model solution obtained by zE(u)/zu � 0 is the minimum value. When a convex function is not differentiable at a certain point, the problem of nondifferentiation is often solved by solving its corresponding convex conjugate. e specific method is as follows: by the Legendre-Fenchel transformation, the convex conjugate of the original function f(x) can be written as Among them, y is a dual variable, and the symbol <. . .> means calculating the inner product of the original variable x and the dual variable y, and the transformed function is differentiable in the entire domain y. erefore, the problem that the original Ll norm is not differentiable at x � 0 can be solved by convex coballoons. e convex conjugation of Ll cost |x| 1 is In the formula, δ(y) is the indicator function, and when ‖y‖ 1 ≤ 1, δ(y) � 0. At the same time, the dual form of the original LI norm can be written as Similarly, the convex conjugate combined with the quadratic cost x 2 is e dual forms of the original quadratic norm and the original Huber norm are We assume that the expression of a common saddle point problem is In the formula, x and Y are two-dimensional real vector spaces, and L is a continuous linear operation on X ⟶ Y. e symbol F * represents the convex conjugation of the function F. e saddle point problem can be regarded as the original problem: or the original dual description of the corresponding dual problem: For the optimization of this kind of saddle point problem, an iterative optimization method for solving the Computational Intelligence and Neuroscience original variable x and the dual variable y is given. First, we obtain partial derivatives of formula (27) to obtain the presolved subformulas of x and y: x � (I + ∇G) −1 (x − Ly), y � (I + ∇F) −1 (y + Lx). e symbol ∇ in the formula is the gradient operator. After that, the original variable and the dual variable are updated iteratively through the gradient descent/rise method. Taking the ROF denoising model as an example, the expression of the original energy model is It is composed of a regular term composed of L1 norm and a data term composed of quadratic norm, where u is the target solution, and f is the input image with noise. When ‖∇u‖ 1 in formula (31) is replaced with the form of formula (24), the original dual form of the ROF denoising model can be obtained: In the formula, p is a dual variable. e convex set R in formula (32) is defined as Among them, ‖p‖ ∞ � max i,j |p i,j | represents the discrete maximum norm. By comparing formula (32) with formula (16) , it is not difficult to see that the original dual form of the ROF denoising model is a saddle point problem. First, for the original dual form of the ROF denoising model (29), the partial derivative of the dual variable p is calculated, and the presolvent formula of the dual variable is obtained: en, we find the gradient of the dual variable u n by fixing the original variable j at the time: Since F * (p) � δ p (p) is an indicator function of a convex set, when solving p n+1 , formula (35) needs to be projected onto the unit sphere 1 (x) � x/ max(1, ‖x‖), so the final iterative update equation for dual variables is Similarly, by calculating the partial derivative of u in formula (32), the presolvent formula of the original variable u is obtained: In the formula, 〈p, ∇u〉 � 〈u, −∇ · p〉. By fixing the obtained p n+1 and performing gradient descent on the original variable, the iterative update equation of the original variable u can be further obtained: Finally, when the algorithm meets the preset iteration termination condition, the global optimal solution of the original variable u can be obtained. Taking the optimization problem of the ROF denoising model as an example, the update of the dual variables (formulas (31) and (32)) mainly includes the point-by-point gradient operation of the original variable and the elementwise projection operation of the variable. We assume that the solution of the energy function is two-dimensional. In the discrete case, the original variable can be expressed as e gradient operation of the original variable can be defined as Computational Intelligence and Neuroscience ∇u ≜ z zx 0 · · · 0 0 · · · 0 0 z zx · · · 0 0 · · · 0 · · · · · · · · · · · · · · · · · · · · · 0 · · · · · · · · · · · · · · · z zx z zx 0 · · · 0 0 · · · 0 · · · z zx · · · 0 0 · · · 0 · · · · · · · · · · · · · · · · · · · · · 0 · · · · · · · · · · · · · · · z zx Moreover, the partial derivatives z/zx and z/zy in each dimension can be realized by forward difference. is article combines digital feature recognition technology to construct a ballet training movement correction system. e technical imaging principle is shown in Figure 5 . When the laser transmitter does not emit a laser beam to the target, the laser speckle is formed by the scattering of the medium, and the laser speckle is projected on the target. In this process, laser speckle is a random process, and probability statistics are used to find the movement law and intensity distribution of the speckle. e process of the whole ballet training movement recognition method is shown in Figure 6 . e correction system in this paper mainly inputs the ballet training action recognition result into the system to compare with the standard action and judge the rationality of its action. erefore, the process of the ballet movement system proposed in this paper is shown in Figure 7 below. e system built in this paper can recognize the pictures and videos of ballet training and can also perform real-time action recognition and correction for ballet training. Figure 8 shows the effect of the system's ballet dance training action correction. On the basis of the above analysis, the model proposed in this paper is verified. First, the digital feature recognition technology proposed in this paper is evaluated, and the statistical test results are shown in Table 1 . On this basis, the correction effect of ballet training is analyzed, and the statistical test results are shown in Table 2 . In general, the experimental research results verify that the digital feature recognition technology proposed in this paper can play an important role in ballet movement recognition and has a good motion correction effect. Most of the students have not received systematic ballet dynamic technical movement training before entering the school, resulting in relatively weak students' basic skills and basic skills and relatively poor acceptance of learning new movements and new techniques. Ballet's dynamic technology has undergone constant casting and tempering by countless ancestors and constant innovations, and a very scientific and standardized training system has been formed. Moreover, it has accumulated a wealth of "clinical" experience in terms of training content and training methods. On this basis, this paper combines digital feature recognition technology to carry out ballet training movement correction analysis and combines the actual needs of actual ballet movement correction to conduct research and proposes a corresponding intelligent training correction system to improve the effect of ballet training. e experimental research results verify that the digital feature recognition technology proposed in this paper can play an important role in ballet movement recognition and has a good motion correction effect. e labeled dataset used to support the findings of this study is available from the corresponding author upon request. e author declares no competing interests. Portable 3D human pose estimation for human-human interaction using a chestmounted fisheye camera Algorithm based on one monocular video delivers highly valid and reliable gait parameters Pose estimate based yoga instructor A review of the evolution of vision-based motion analysis and the integration of advanced computer vision methods towards developing a markerless system DTCoach: your digital twin coach on the edge during COVID-19 and beyond Multiple human 3d pose estimation from multiview images Multi-person hierarchical 3d pose estimation in natural videos Multipath affinage stacked-hourglass networks for human pose estimation Judging the normativity of PAF based on TFN and NAN Human pose estimation in video via structured space learning and halfway temporal evaluation Action recognition using deep convolutional neural networks and compressed spatio-temporal pose encodings VNect Realtime multi-person 2D pose estimation Hierarchical contextual refinement networks for human pose estimation A multi-stage convolution machine with scaling and dilation for human pose estimation Deep probabilistic human pose estimation Metrabs: metric-scale truncation-robust heatmaps for absolute 3d human pose estimation Body part extraction and pose estimation method in rowing videos An evaluation of pose estimation in video of traditional martial arts presentation [Papers] keep your eye on the ball: detection of kicking motions in multi-view 4K soccer videos fast and accurate whole-body pose estimation in the wild and its applications is study was sponsored by Guangxi University of Arts.