This paper presents a comparative analysis of two parallel sorting algorithms, bitonic sort and parallel radix sort, implemented in OpenCL for GPU architecture. The performance of these algorithms is compared to quicksort on an NVIDIA GPU, showing that while radix sort is generally the fastest, quicksort outperforms them when considering overall execution time including data transfer. The study highlights the importance of choosing appropriate parallel sorting algorithms based on specific application needs and GPU architecture.