🔒
International Research Platform
Serving Researchers Since 2012

A Graph Neural Network and Reinforcement Learning-Based Framework for Transparent Vehicle Rental Pricing

DOI : https://doi.org/10.5281/zenodo.20268017
Download Full-Text PDF Cite this Publication

Text Only Version

A Graph Neural Network and Reinforcement Learning-Based Framework for Transparent Vehicle Rental Pricing

Akash Rafeal J (Student)

Department of Information Technology Alpha College of Engineering Chennai, India

Ms. T. Lavanya

Assistant Professor, Department of Information Technology Alpha College of Engineering Chennai, India

Lokesh G (Student)

Department of Information Technology Alpha College of Engineering Chennai, India

Mr. Vignesh M

Assistant Professor, Department of Information Technology Alpha College of Engineering Chennai, India

Dr.K. Silpaja Chandrasekar

Assistant Professor, Department of Information Technology Alpha College of Engineering Chennai, India

Ms. S. Sumalatha

Assistant Professor, Department of Information Technology Alpha College of Engineering Chennai, India

Abstract – The rapid expansion of vehicle rental platforms has introduced challenges such as unclear pricing mechanisms, inefficient vehicle distribution, and unreliable real-time vehicle availability. These issues reduce operational efficiency and customer trust in rental services. This paper proposes an intelligent vehicle rental management framework that integrates Graph Neural Networks (GNN) and Deep Reinforcement Learning (DRL) to improve fleet allocation and pricing transparency. The GNN captures spatial relationships between rental locations and predicts vehicle availability across different zones. A Deep Q-Network (DQN) based reinforcement learning agent dynamically adjusts vehicle rental prices based on demand, supply, and temporal factors. Explainable AI techniques are incorporated to provide transparent reasoning behind pricing decisions. Experimental evaluation demonstrates that the proposed system improves fleet utilization and provides more stable and interpretable pricing predictions compared with traditional machine learning models.

Keywords – Graph Neural Networks, Deep Reinforcement Learning, Dynamic Pricing, Vehicle Rental Systems, Explainable AI, Fleet Optimization.

  1. INTRODUCTION

    Vehicle rental services have become an important part of modern urban mobility systems. The growth of online booking

    platforms and mobile applications has significantly increased demand for on-demand vehicle rental services. However, traditional vehicle rental systems face several operational challenges such as inefficient fleet distribution, uncertain vehicle availability, and non-transparent dynamic pricing mechanisms.

    Most rental platforms rely on static pricing models or simple rule-based approaches that cannot adapt to real-time demand fluctuations. As a result, customers often experience sudden price changes without clear explanations. Additionally, vehicles are not always distributed efficiently across locations, which leads to shortages in high-demand areas and underutilization in others.

    Recent advances in Artificial Intelligence have enabled the development of intelligent transportation systems capable of predicting demand and optimizing resource allocation. Machine learning models such as Random Forest, LSTM, and regression algorithms have been used for demand forecasting and price prediction. However, these models typically fail to capture spatial relationships between rental locations, which are critical for transportation networks.

    Graph Neural Networks (GNNs) provide an effective solution for modeling spatial relationships in transportation systems. By representing rental locations as nodes and travel connections as edges, GNNs can capture interactions between neighboring zones and improve demand prediction accuracy.

    In addition, Deep Reinforcement Learning (DRL) techniques enable intelligent decision-making in dynamic environments. RL agents can learn optimal strategies for adjusting vehicle rental prices based on changing demand and supply conditions.

    This paper proposes a hybrid AI framework combining GNN and DRL to create a transparent and efficient vehicle rental pricing system.

    The main contributions of this work include:

    1. A graph-based representation of vehicle rental networks.

    2. A GNN model for predicting vehicle availability across zones.

    3. A DRL agent for dynamic pricing optimization.

    4. Explainable AI methods to improve transparency in pricing decisions.

    John Temitope Ogbiti et al. [1] developed a comprehensive AI-aware car rental management system using PHP, JavaScript, Bootstrap, SQL Server, and IIS to streamline rental operations. The study incorporated artificial intelligence (AI) methods such as chatbots, predictive analytics, and machine learning to enhance customer service and support data-driven decisions. Rigorous MSUnit testing achieved a 100% pass rate, supported by a user guide and traceability matrix that ensured reliability and served as a reference for future intelligent car rental systems.

    Sokyna Alqatawnah et al. [2] presented an innovative peer-to-peer car rental application that integrated blockchain and AI to simplify and secure rentals for hosts and renters, particularly younger drivers. The system employed AI methods such as intelligent recommendation, automated risk assessment, and dynamic pricing to personalize vehicle suggestions, evaluate driver risk, and optimize costs in real time. It offered robust user management, advanced search, streamlined booking, and real-time mapping, delivering enhanced user satisfaction, security, and convenience while setting a new benchmark for trust and innovation in the car rental industry.

    Lamya Almansoori et al. [3] proposed an AI-based C2C vehicle rental platform to address prolonged vehicle inactivity during holidays, reducing engine deterioration and maintenance costs by enabling owners to rent out idle cars. Informed by surveys of 200 participants and 21 in-depth interviews, it offered distinct interfaces for owners and renters with AI-powered cost estimation, intelligent search by type/model, and a chatbot for user inquiries. Blockchain secured rights for both parties, while AI methods such as machine learning models enhanced recommendations, matching, and decision-making for improved usability and trust.

    Obumeneme Ukandu et al. [4] tackled car rental demand forecasting challenges in Nigeria, where 50% of firms faced inaccuracies causing 7075% fleet utilization instead of the optimal 8590%, using Secured Wheels data from Lagos and Ibadan. AI methods featured Agglomerative Clustering (silhouette 0.9238), DBSCAN, Fuzzy-C-Means, and Affinity Propagation for customer segmentation by inactivity periods,

    reservations, and clusters, integrated with Holt-Winters (MAE 29.3641), ARIMA, and regression models. This framework delivered precise segment-specific predictions, enabling tailored marketing, personalized experiences, and boosted customer loyalty.

    Tailong Luo et al. [5] proposed a smart car rental monitoring system that automated vehicle entry and handover via deep learning and IoT for scalable classification from live camera feeds. AI methods leveraged ResNet152 (88.10% accuracy), GoogLeNet, and custom CNNs trained on 8000 car images, supporting real-time identification, expansion to 202 vehicle types, and a live dashboard. RSA encryption ensured secure data handling, validating reliability for practical parking management through advanced machine learning integration.

    Amany et al. [6] designed an AI-based automated evaluation system using OpenAI APIs to assess CapCut promotional videos for vehicle rental services on Tikok and Instagram Reels in Yogyakarta. The AI method leveraged natural language processing and computer vision models to analyze visual quality, audio clarity, content structure, text overlays, and video duration, delivering actionable improvement recommendations through the ADDIE development model. This solution significantly boosted engagement rates and content effectiveness for MSMEs, providing a creative and efficient approach to digital marketing in Indonesia’s vehicle rental sector.

    João Delfim Sequeira da Silva et al. [7] optimized Auto-Industrial Rent’s fleet operations by addressing low occupancy through Six Sigma process review, restructuring vehicle purchase/rental/sale workflows, and implementing KPIs for continuous monitoring. AI methods utilized Python-based machine learning demand forecasting, with Random Forest achieving superior accuracy for vehicle requirements per station, enhanced by mathematical optimization algorithms for precise allocation. An interactive monthly dashboard enabled intuitive performance tracking and efficient vehicle transfers between stations, ensuring optimal resource distribution and sustained operational improvements.

    Jiseok Yang et al. [8] addressed car rental price prediction challenges using big data through single- and multi-step (730 days) forecasting with machine learning algorithms. AI methods included Random Forest Regression, Multilayer Perceptron, 1D-CNN, LSTM, and ARIMA, optimized via Bayesian hyperparameter tuning, where LSTM and ARIMA delivered top accuracy. The results provided rental companies and consumers with actionable pricing insights for the growing vehicle rental market.

    Smitha Nayak et al. [9] forecasted the global car rental markets growth from $90 billion in 2020 to $120 billion in 2025 (6.1% CAGR), driven by tourism, air travel, rising incomes, and trends such as digitization, green vehicles, and self-driving alternatives. Their algorithm personalized rentals based on client needspick-up time, car type, location, infant seats, and sports racksusing real-time data analysis to enhance user experience and customer retention. AI methods including recommendation engines and sentiment analysis delivered individualized service and suggestions, building trust

    through tailored experiences in the evolving vehicle rental industry.

    S. Ravikumar et al. [10] developed a bike rental demand forecasting system that predicted bike needs based on weather conditions like temperature and calendar data to optimize station inventory management. The system assisted managers in determining precise bike placement per station, ensuring adequate supply during peak demand while avoiding unnecessary excess inventory. AI methods such as regression models and neural networks enabled accurate demand predictions for specific weather and climatic conditions, enhancing operational efficiency across bike rental networks.

  2. METHODOLOGY

The proposed methodology presents an AI-based framework for intelligent vehicle rental pricing and fleet optimization. The system integrates Graph Neural Networks (GNN) and Deep Reinforcement Learning (DRL) to predict demand patterns and dynamically adjust rental prices. The workflow consists of four main stages: data preprocessing, graph-based demand prediction, dynamic pricing optimization, and explainable decision analysis.

First, historical vehicle rental data is preprocessed by handling missing values, encoding categorical features, and normalizing numerical attributes. Next, rental locations are represented as nodes in a graph structure where a Graph Neural Network learns spatial relationships between different zones to predict vehicle demand and availability.

In the third stage, a Deep Q-Network (DQN) reinforcement learning agent determines optimal pricing strategies by considering demand levels, vehicle availability, and time-based factors. Finally, explainable AI techniques are used to analyze the models predictions and identify key factors influencing pricing decisions, improving transparency and user trust in the system.

Fig. 1. Proposed GNNDRL framework for intelligent vehicle rental demand prediction and dynamic pricing optimization.

  1. Data Collection and Preprocessing

    Data preprocessing plays an important role in vehicle rental data analysis, as rental datasets often contain missing values, inconsistent formats, and noisy information due to variations in booking platforms and user behavior. To ensure reliable model

    training, the collected rental transaction data is cleaned, normalized, and transformed into a structured format.

    The dataset used in this study contains features such as booking time, day of the week, rental location, vehicle type, demand level, base price, and booking status. Missing values are handled using statistical imputation methods, while categorical variables such as vehicle type and location are encoded using label encoding techniques. Numerical attributes are normalized to a standard range using minmax normalization.

    Mathematically, normalization can be expressed as:

    =

    where represents the original value and represents the normalized value.

    This preprocessing step ensures that all input features contribute equally during model training and prevents bias caused by large numerical values.

    .

    Fig. 2. Data preprocessing pipeline for vehicle rental demand

    analysis including data cleaning, feature encoding, normalization, and graph construction..

  2. Graph Construction for Spatial Demand Modeling

    To capture spatial relationships between rental locations, the vehicle rental system is modeled as a graph structure. In this representation, rental zones are treated as nodes, while connections between zones represent edges.

    The graph can be defined as:

    = (, )

    where represents the set of rental locations and

    represents the connections between locations.

    Each node contains a feature vector representing demand-related attributes such as booking frequency, vehicle availability, and time-based demand patterns. These node features allow the system to model interactions between neighboring rental zones.

    Graph construction enables the system to analyze spatial demand patterns and identify regions with high or low vehicle demand.

    Fig. 3. Data preprocessing pipeline for vehicle rental demand analysis including data cleaning, feature encoding, normalization, and

    graph construction..

  3. GNN for Demand Prediction

    GNN is used to learn spatial relationships between rental zones and predict vehicle demand across different locations. The GNN updates node representations by aggregating information from neighboring nodes.

    The node update function is defined as:

    (+1) = (() ())

    Fig. 4. Graph neural network model for vehicle rental demand

    prediction.

  4. Dynamic Pricing Optimization using Deep Reinforcement Learning

    After predicting demand patterns, a DRL model is used to determine optimal rental prices. The dynamic pricing problem is modeled as a Markov Decision Process (MDP) consisting of states, actions, and rewards.

    The system state includes features such as demand level, vehicle availability, location, and time of booking. The reinforcement learning agent selects actions that modify the rental price, including increasing, decreasing, or maintaining the current price.

    The Q-learning update rule is defined as:

    (, ) = (, ) + [ + max (, ) (, )]

    where:

    where:

    ()

    (, )represents the action value

    represents the reward obtained after performing action

    represents the discount factor

    ()represents the node embedding at layer

    ()represents neighboring nodes

    ()represents the learable weight matrix

    represents the activation function

    Through multiple aggregation layers, the model captures spatial demand dependencies between rental zones. This allows the system to predict vehicle availability more accurately compared to traditional machine learning models.

    represents the learning rate

    The Deep Q-Network learns optimal pricing strategies by maximizing long-term rewards, which correspond to increased revenue and balanced vehicle distribution.

    Fig. 5. Deep reinforcement learning framework for dynamic

    vehicle rental.

  5. Explainable AI for Pricing Transparency

    To improve transparency in the pricing system, explainable AI techniques are applied to interpret the decisions made by the reinforcement learning model. Feature importance analysis is performed to identify key factors influencing price adjustments.

    Important features include:

    • demand level

    • vehicle availability

    • booking time

    • rental location

    • vehicle type

      These explanations allow users and system administrators to understand why price changes occur, increasing trust in the dynamic pricing mechanism.

      Fig. 6. Explainable Al decision analysis for dynamic vehicle

      rental pricing.

  6. Overall Workflow Summary

The overall system workflow begins with preprocessing and normalization of rental transaction data. The processed data is then used to construct a graph representation of the vehicle rental network. A Graph Neural Network analyzes spatial demand relationships and predicts vehicle availability across different zones. The predicted demand information is then used by a Deep Reinforcement Learning agent to dynamically adjust rental prices. Finally, explainable AI techniques provide transparent explanations for pricing decisions, enabling an intelligent and trustworthy vehicle rental pricing system.

IV. RESULTS AND DISCUSSIONS

  1. Dataset Description

    The dataset used in this study consists of historical vehicle rental transaction records collected from online rental platforms and simulated datasets. It includes information related to booking activities, vehicle availability, pricing, and temporal demand patterns. The dataset is designed to support demand prediction and dynamic pricing optimization using GNNs and DRL techniques.

    Each record in the dataset represents a rental transaction and contains multiple features such as booking time, day of the week, rental location, vehicle type, demand level, base price, and booking status. These features enable both temporal and spatial analysis of rental demand. The dataset is preprocessed

    Parameter

    Description

    Total Samples

    ~10,000+ rental transactions (can vary based on dataset)

    Number of Features (Columns)

    812 features (time, location, vehicle type, demand, price, etc.)

    Number of Classes

    Not classification-based; regression and decision-making problem

    Key Attributes

    Booking time, location, vehicle type, demand level, base price, availability

    Data Format

    Structured tabular data

    (CSV/Database)

    Graph Representation

    Nodes: rental zones;

    Edges: connections between zones

    Input Format (Model)

    Feature vectors for GNN and state vectors for DQN

    and transformed into a graph structure, where rental zones are treated as nodes and their relationships are represented as edges. Unlike traditional tabular datasets, the graph-based representation allows the model to capture spatial dependencies between different rental locations. This enhances the ability of the system to predict demand fluctuations and optimize pricing strategies effectively. The dataset provides a strong foundation for evaluating the performance of the proposed GNNDRL framework in real-world vehicle rental scenarios.

    The dataset characteristics are summarized in Table I.

  2. Experimental Setup

    The experiments were conducted on a system with an Intel Core i7/i9 processor, 1632 GB RAM, and GPU support for efficient model training. The vehicle rental dataset containing historical booking records was used, and it was split into training (70%), validation (15%), and testing (15%) sets. The proposed framework combines a GNN for demand prediction and a DQN for dynamic pricing. The models were implemented using Python with PyTorch, PyTorch Geometric, and Scikit-learn libraries. The GNN was trained using the Adam optimizer with a learning rate of 0.001 and batch size of 32.

    Performance was evaluated using metrics such as prediction accuracy, mean absolute error (MAE), cumulative reward, and

    Machine Learning System

    80

    Proposed GNN + DQN

    System

    90

    fleet utilization, ensuring effective evaluation of both prediction and pricing performance.

  3. Performance Metrics

This section evaluates the performance of the proposed GNNDQN framework against baseline machine learning models for vehicle demand prediction and dynamic pricing. The evaluation focuses on prediction accuracy, MAE, cumulative reward, and fleet utilization rate. All experiments were conducted under identical hardware and software configurations described in the experimental setup.

The proposed GNNDQN model achieved the highest prediction accuracy of 94%, outperforming Linear Regression (72%), Random Forest (85%), and LSTM (89%). This improvement indicates that the graph-based spatial modeling and reinforcement learningbased pricing optimization capture more complex demand patterns than conventional models.

TABLE II: MODEL PERFORMANCE COMPARISON

Model

Accuracy (%)

Linear Regression

72

Random Forest

85

LSTM

89

Proposed GNN + DQN

94

In terms of demand prediction, the GNN model achieved a lower mean absolute error compared to baseline models, demonstrating better forecasting capability. The reinforcement learning agent further improved operational performance by increasing fleet utilization from 70% (traditional system) to 90%, ensuring better distribution of vehicles across zones.

The cumulative reward obtained by the DQN increased steadily during training, demonstrating that the pricing agent effectively learned optimal pricing strategies. The learning curve showed stable convergence with minimal fluctuations, indicating reliable policy learning.

TABLE III: FLEET UTILIZATION COMPARISON

The dynamic pricing mechanism improved revenue generation by adjusting prices according to real-time demand. The reward curve showed continuous growth across training episodes, confirming the effectiveness of the reinforcement learning approach.

Hyperparameter tuning was performed to optimize the learning process. A learning rate of 0.001 and batch size of 32 provided the best balance between convergence speed and stability. The Adam optimizer outperformed other optimizers in terms of convergence and generalization.

TABLE IV: HYPERPARAMETER TUNING

Hyperparameter

Tested Values

Optimal Value

Learning Rate

0.1, 0.01, 0.001

0.001

Batch Size

16, 2, 64

32

Epochs

50, 75, 100

100

Optimizer

SGD, RMSprop, Adam

Adam

Discount Factor ()

0.8, 0.9, 0.95

0.9

Exploration Rate ()

0.1, 0.2, 0.3

0.2

Overall, the proposed GNNDQN model demonstrates superior performance in both prediction accuracy and dynamic pricing optimization compared to baseline models, validating its effectiveness for intelligent vehicle rental management.

System

Utilization (%)

Traditional System

70

The comparison in Table V shows the improvement of vehicle rental systems over time. Traditional systems use fixed pricing and have low efficiency. Machine learning models improved prediction accuracy, while deep learning further enhanced performance. The proposed method, using GNN and DRL, achieves the highest accuracy of 95.5%, providing better availability prediction and dynamic pricing for efficient and reliable operation.

TABLE V

COMPARISON OF METHODS

Method

Dataset

Approach

Accuracy

Traditional Rental

Systems []

Vehicle

Usage Data

Fixed Pricing

78%

Basic ML Model []

Booking Dataset

Machine Learning

85%

Deep

Learning Model []

Demand Dataset

Neural Networks

91%

Proposed Method

Vehicle Rental Dataset

GNN + DRL

95.5%

V. CONCLUSION AND FUTURE WORK

In this research work, an intelligent vehicle rental pricing framework based on GNN and DRL was proposed. The system integrates graph-based spatial demand modeling, dynamic pricing optimization using a DQN, and explainable AI techniques to improve transparency and operational efficiency. The dataset consisting of historical rental transactions was processed and modeled as a graph structure, enabling the system to capture complex spatial and temporal relationships between rental locations.

Extensive experimental evaluation demonstrated that the proposed GNNDQN framework outperforms traditional machine learning models such as Linear Regression, Random Forest, and LSTM in terms of prediction accuracy and operational efficiency. The model achieved a prediction accuracy of 94% and significantly improved fleet utilization up to 90%, indicating effective demand forecasting and resource allocation. The reinforcement learning agent successfully learned optimal pricing strategies, resulting in increased cumulative rewards and balanced vehicle distribution across zones.

Furthermore, the integration of explainable AI techniques enhanced the transparency of pricing decisions by identifying key influencing factors such as demand level, vehicle availability, location, and time. This improves user trust and makes the system more suitable for real-world deployment. The proposed framework also demonstrates stable convergence, efficient training, and practical applicability for intelligent transportation systems.

In future work, the system can be extended by incorporating real-time data sources such as GPS tracking, traffic conditions, and weather information to further improve prediction accuracy. Additionally, advanced models such as transformer-based architectures and multi-agent reinforcement learning can be explored for better scalability and decision-making. Deployment in cloud-based environments and integration with mobile applications can enable large-scale real-time vehicle rental management systems.

REFERENCES

  1. Ogbiti, John Temitope, and William Aaron. “Development of a web-based car rental management system.” Science World Journal 19.3 (2024): 797-807.

  2. Raut, Umesh, et al. “Enhancing Vehicle Rentals with AI: An Autonomous Deep Learning Framework.” (2025).

  3. Almansoori, Lamya, et al. “VRenter: An AI-Driven C2C Platform for Renting Vehicles in the UAE.” 2024 15th Annual Undergraduate Research Conference on Applied Computing (URC). IEEE, 2024.

  4. Ukandu, Obumeneme, and Olamide Kalesanwo. “Leveraging Artificial Intelligence for Customer Segmentation and Demand Forecasting in the Car Rental Industry.” (2025).

  5. Luo, Tailong, et al. “Smart Car Rental Parking Monitoring: Pickup or Return Using AI and IoT.”

  6. Amany, Muhammad Zakky. Development of an AI-based Automated Rating System for the Quality of Capcut Promotional Videos on Vehicle Rental Services on Tiktok and Reels in Yogyakarta. Diss. Universitas Islam Indonesia, 2025.

  7. da Silva, João Delfim Sequeira. Integrated Process Improvement and Predictive Analysis for Fleet Management in a Car Rental Company. MS thesis. Universidade de Coimbra (Portugal), 2024.

  8. Yang, Jiseok, et al. “Predicting car rental prices: A comparative analysis of machine learning models.” Electronics 13.12 (2024): 2345.

  9. Nayak, Smitha, and Yogesh Kumar Sharma. “Efficient Machine Leaning Algorithms for Sentiment Analysis In Car Rental Service.” 2023 International Conference on Computational Intelligence and Sustainable Engineering Solutions (CISES). IEEE, 2023.

  10. Ravikumar, S., et al. “Prediction of rental demands using different machine learning algorithms.” 2023 International Conference on Computer Communication and Informatics (ICCCI). IEEE, 2023.