Аннотация:We introduce the detailed comparison of meta-heuristic and reinforcement learning algorithms implementation in the area of quan-tum computations on the example of effective optimization of quantum control scheme to produce quantum logic gates with maximum fidelity to their theoretical counterpart. In particular, we compare the decision mak-ing process of the Genetic Algorithm (GA) as a meta-heuristic algorithm and Proximal Policy Optimization (PPO) as a reinforcement learning algorithm. We provide the comparison via the t-SNE and UMAP dimen-sionality reduction algorithms and analyze solution exploration process for each algorithm based on the reduced representation of generated solu-tions during training. Besides, a detailed review of the investigated prob-lems and the algorithms used in the paper is given.