DOI : 10.17577/IJERTV15IS043579
- Open Access
- Authors : Soysa H. R. T., W.S.A. De Silva, Peiris W. V. D., Gamage P. U.
- Paper ID : IJERTV15IS043579
- Volume & Issue : Volume 15, Issue 04 , April – 2026
- Published (First Online): 05-05-2026
- ISSN (Online) : 2278-0181
- Publisher Name : IJERT
- License:
This work is licensed under a Creative Commons Attribution 4.0 International License
An Intelligent Market Price Forecasting and Sustainability Assessment for Anthurium Floriculture: A Data-Driven Decision Support Approach
Soysa H. R. T., W.S.A. De Silva
Sri Lanka Institute of Information Technology (SLIIT), Malabe, Sri Lanka
Peiris W. V. D., Gamage P. U.
Sri Lanka Institute of Information Technology (SLIIT), Malabe, Sri Lanka
AbstractThe markets dealing with perishable ornamental crops such as Anthurium flowers show dynamic pricing changes on a daily basis, and these changes cannot easily be explained by existing theories. Floriculturists engaged in such markets, especially at smaller and medium scale levels, are continuously confronted by the need to make selling decisions without any quantitative method by which they can evaluate if the current or expected prices represent a truly good opportunity to sell. This paper proposes a system for market intelligence designed to solve this problem. The main component of this system is a price prediction engine built with a sequential deep learning model and ensemble regression layers using historical price data collected for eight consecutive days for each shop, variety and size combination to predict the daily prices. The proposed system is based on a market scoring index known as SAMIS, which will provide market assessments after predicting prices according to its statistical price sustainability, momentum and price risks. SAMIS results are evaluated as a composite score between zero and hundred and are further classified according to a four-category classification of recommendations for different market situations ranging from ideal selling opportunities to periods where deferral of transactions is suggested. Evaluations of the system were done using real transactional data available in Sri Lankan Anthurium markets and showed stability in forecasting prices along with proper market assessment results. To our best knowledge, this study is the first to introduce a systematic approach for assessing the sustainability of ornamental flower markets.
Index Terms-Anthurium, floriculture market analytics, price forecasting, market sustainability, decision support, SAMIS, Sri Lanka, agricultural informatics, time-series prediction
-
INTRODUCTION
In the world of tropical cut floriculture, Anthurium has become one of the more commercially difficult-to-manage crops. Careful attention must be paid to growing conditions for the flowers, as well as their relatively high market value and extended vase life. This type of flowering plant is cultivated on a limited scale in the Gampaha, Matale, and Kalutara districts of Sri Lanka. In this area, the climate, soil, and social conditions enable growers to earn considerable incomes and have contributed to the development of a modestly successful business sector. As in any commercial operation, success is determined not just by the quality and amount of products produced, but also by day-to-day processes at local flower markets.
However, one of the main problems encountered in such businesses is the lack of structured information concerning current market prices. This is because fluctuations may occur every day, based on factors such as festival demand increase and temporary oversupply of flowers from other provinces. Usually, local farmers try to find out about prices through calling intermediaries on a daily basis or visiting local market collection centers during sales. Unfortunately, in most cases, harvesting and selling flowers take place without consideration of current price levels. This leads to losses and low profitability due to a failure to properly match the supply to the demand.
Although a lot of research has been done in recent years, the field of price prediction for agrarian products is still far from being well-understood and implemented effectively in ornamental floriculture, especially concerning Anthurium flowers. This is because the majority of models created thus far were used to forecast staple crop prices, where rich histories, futures contracts, and accessible data helped develop effective methods. However, since ornamental flower markets are localized and scattered, there is little reliable information about the price levels available in the open source literature. Studies that have looked at such markets found that while a single model approach failed to describe the dynamics of prices, ensembles did better.
Apart from predicting price accurately, there is an even larger hole that this paper intends to plug. While a price prediction model can give the most precise forecast, it cannot tell the farmer whether or not the predicted price signifies a favorable period for sale. For instance, while a highly accurate predicted price is high when compared to yesterday's price, it may actually be an outlier in relation to the price history of the crop in question due to sudden increase in demand. In contrast, a less predicted price in a stable market environment is more reliable than a higher price in a highly dynamic market environment. However, there is no distinction between such scenarios in most price forecasting models used by farmers and growers, thus creating a need for a solution to be sought in this research.
This paper will make two major contributions. Firstly, it will develop a price prediction model based on a sequential fusion framework that leverages structured historical price sequences to produce daily Anthurium price predictions. Secondly, the more significant contribution of this paper is the development of SAMIS (market sustainability scoring index), which measures the predicted price on three independent statistical dimensions and classifies the predicted market into one of the four action categories. The structure of the rest of the paper is as follows: related works in Section 2; methodology in Section 3; experimental results in Section 4; and discussion and conclusion in Sections 5 and 6 respectively.
-
RELATED WORK
-
Agricultural Price Forecasting
Early attempts at price prediction relied mostly on statistics. The main approach was represented by ARIMA models and seasonally adjusted versions of ARIMA models until well into the 2000s. Although these approaches are still used, they are known to struggle with structural volatility, particularly in non-stationary data sets. Essentially, the problem lies in the fact that ARIMA-family models require linearity and stationarity both of which rarely occur in the spot price market for perishable goods with demand shocks, harvest losses, and periodic cycles of supply.
The next logical step would be machine learning algorithms. In fact, studies using Support Vector Regression, Gradient Boosting trees, and Random Forest Regression models proved that the machine learning approach can surpass linear models and show better results in predictions for prices of vegetables, grains, and fruits. However, this approach relies on cross-sectional regression analysis; hence, any kind of sequencing, like weekly cycles of demand and slow price changes over weeks, needs to be accounted for with manual introduction of lagged data.
Finally, deep learning algorithms came up with the solution for the sequencing problems with recurrent neural networks. Long Short-Term Memory models showed a great potential in predicting time series and, unlike earlier algorithms, required no additional manual work in creating lagged features. Multiple studies analyzing vegetable and grain prices found that sequence models could outperform classical models on the data set large enough and stable enugh. Recently, even more sophisticated methods that involved training ensembles on outputs from sequence models have proven successful due to the combination of advantages: while the former captures complex temporal dynamics impossible to infer using the features themselves, the latter allows accounting for changes in distributions and non-temporal factors.
-
Floriculture and Ornamental Crop Markets
Floriculture markets have not received sufficient research attention. Several investigations into cut flower price prediction using machine learning and hybrid models have been undertaken in the Dutch and Kenyan export market environments where flower price information is easily
available. Such research has largely demonstrated the use of machine learning and hybrid models in flower pricing problems, but such studies apply to wholesale export markets that do not correspond with the smaller retail and semi-wholesale markets common in the case of the Sri Lankan Anthurium flower business. Research studies focused on Anthurium price prediction are virtually non-existent.
-
Market Sustainability and Risk Assessment
The concept of market sustainability in the context of agricultural price forecasting has received almost no formal treatment in the literature. Existing works that touch on market risk typically employ simple volatility measures standard deviation of historical prices, or price range ratios as contextual metadata rather than as components of a structured decision framework. The Coefficient of Variation, defined as the ratio of standard deviation to mean price, has appeared in commodity market analysis as a normalized risk indicator, but its integration into scoring mechanisms for producer decision support has not been systematically explored.
Statistical approaches to price stability assessment, particularly those grounded in the Empirical Rule and z-score interval logic, have been applied in financial risk management but have not been translated into agricultural market scoring frameworks. The SAMIS index proposed in this paper draws on these statistical foundations and organizes them into a coherent, decomposable scoring structure designed specifically for the operational constraints and decision contexts of small-scale floriculture producers.
-
Gap Analysis and Positioning of This Work
Table 1 summarizes the positioning of this work relative to representative studies in the broader literature. The comparison criteria reflect the dimensions along which the present system claims novelty: hybrid model architecture, floriculture domain applicability, and the presence of a market sustainability assessment mechanism beyond raw price prediction.
Study
Crop
Hyb
Seque
Flo
Sust
Sell
/
/
rid
nce
ric
aina
Rec
Syste
Doma
Mod
Mode
ult
bilit
om
m
in
el
ling
ure
y
me
Spe
Scor
nda
cifi
ing
tio
c
n
ARIM
Gener
No
Partial
No
No
No
A-
al
based
crops
syste
ms [3]
SVR /
Veget
Parti
No
No
No
No
RF
ables,
al
appro
grains
aches
[4,5] LSTM
Grain,
No
Yes
No
No
No
standa
comm
lone
odity
[6] LSTM
Com
Yes
Yes
No
No
No
modit
XGBo
y
ost
marke
fusion
t
[7] Cut
Expor
No
Partial
Yes
No
/td>
No
flower
t
deman
floric
d
ulture
studie
s [8]
CV-
Com
No
No
No
Parti
No
based
modit
al
risk
y
tools
marke
[9] ts
Propo
Anth
Yes
Yes
Yes
Yes
Yes
sed
uriu
(SA
(4-
Syste
m, Sri
MIS
clas
m
Lank
)
s)
(This
a
Work
)
shopvarietysize combination. This eight-day window was selected as a practical balance between temporal depth and data availability, since some product combinations have limited trading frequency. When fewer than eight historical records are available prior to the target date, the earliest available price is repeated forward to fill the remaining positions. This edge-padding approach preserves sequence shape without introducing artificial directional bias.
For each position i within the eight-day window, a five-dimensional feature vector is constructed from the price sequence. The features are:
Table 1. Comparative analysis of related systems and positioning of the proposed work.
-
-
ETHODOLOGY
The system operates in two sequential stages. In the first stage, a predicted market price is generated for a given combination of date, shop, flower variety, and size. In the second stage, that predicted price is evaluated against the statistical properties of the historical price record for the same product combination, producing a SAMIS score and a four-class market recommendation. Each stage is described below in full detail.
-
Data and Feature Preparation
-
Dataset
The dataset comprises daily Anthurium transaction records collected from retail shops in Sri Lanka. Each record captures the transaction date, the shop identifier, the flower variety, the size grade, the seller price per stem in Sri Lankan Rupees (LKR), and the number of units sold that day. Prior to model training, categorical fields were standardized to lowercase and stripped of whitespace. Date fields were parsed using day-first formatting, and records with missing price or date entries were excluded. The resulting dataset provides the historical price trajectories used for both model training and SAMIS computation.
-
Lookback Window and Sequence Construction
Price prediction at a given target date is conditioned on the eight most recent trading days of price history for the same
Featur
e
Symbol
Description
Current price
p
The actual recorded seller price at position i
Lag-1
p(i-1)
Price from the previous day
captures immediate momentum
Lag-2
p(i-2)
Price from two days prior
captures short-term trend direction
Lag-4
p(i-4)
Price from four days prior
captures mid-week cycle effects
EMA-4
EMA4(p[0
:i])
Exponential moving average with span 4, computed up to position i represents smoothed trend
Table 2. Five-dimensional feature vector constructed at each position of the price sequence.
The Exponential Moving Average with span s is computed recursively. Given a smoothing factor = 2 / (s + 1), for span 4 this gives = 0.4:
EMA_t = 路 p_t + (1 ) 路 EMA_{t-1}
The higher weighting of recent observations under = 0.4 means the EMA responds quickly to genuine trend changes while attenuating isolated daily fluctuations. Including EMA alongside raw lag values gives the sequence model access to both the actual historical price at each step and a noise-reduced representation of recent price direction. Together, these five features form the input to the sequential learning component of the prediction engine.
-
-
Price Prediction Architecture
-
Sequential Feature Extraction
The eight-day, five-feature input tensor is first processed by a Long Short-Term Memory network trained to learn temporal regularities in Anthurium price sequences. LSTM networks are well suited to this task because of their gating mechanism, which allows the model to selectively retain or discard information as it progresses through the time series. In markets where weekly demand cycles, gradual seasonal drift, and occasional structural breaks are all present, this selective memory is particularly useful.
At each time step t, the LSTM cell updates its internal state using three gates. The forget gate determines what proportion of the previous cell state to retain, the input gate controls how
much of the new candidate information to incorporate, and the output gate determines what portion of the updated cell state is exposed as the hidden state. These computations proceed as follows:
f_t = (W_f 路 [h_{t-1}, x_t] + b_f) i_t = (W_i 路 [h_{t-1}, x_t] + b_i) o_t = (W_o 路 [h_{t-1}, x_t] + b_o)
c_t = tanh(W_c 路 [h_{t-1}, x_t] + b_c) c_t = f_t c_{t-1} + i_t c_t
h_t = o_t tanh(c_t)
The final hidden state h_8, produced after processing all eight time steps, is passed through a linear output layer to generate a preliminary price estimate referred to throughout this paper as lstm_pred. This value represents the network's learned expectation of the next price given the observed historical sequence.
-
Ensemble Fusion for Final Prediction
While the sequential model captures temporal price dynamics effectively, it operates on a fixed feature structure and cannot easily incorporate the kind of categorical context
shop identity, flower ariety, size grade that meaningfully influences Anthurium price formation in local markets. Different shops serve different buyer demographics and have different pricing power. Different varieties command structurally different price levels. A model that ignores this context will exhibit systematic bias across product combinations.
To address this, the lstm_pred value is assembled into a richer feature vector that includes contextual market attributes alongside statistical features derived from the same price window:
x_RF = [shop, variety, size, lstm_pred, units_sold, lag_1, lag_2, lag_4, EMA_4]
This vector is passed through a pre-trained preprocessing pipeline that applies ordinal encoding to categorical fields and scaling to numerical features, then fed into a Random Forest Regression model. The Random Forest generates the final price prediction by averaging across an ensemble of B independently grown decision trees:
p _final = (1/B) 路 _{b=1}^{B} T_b(x_RF)
The ensemble averaging suppresses variance in the final prediction a property that matters considerably in markets where individual transaction outliers can distort a single tree's output. The final predicted price is rounded to the nearest 0.5 LKR increment, reflecting the half-rupee pricing conventions observed in Sri Lankan Anthurium retail markets. This rounding step is implemented as:
p _rounded = round(p _final 脳 2) / 2
The overall prediction pipeline can be summarized as a sequential operation in which the temporal model extracts a price representation from the historical sequence, and the ensemble model refines that representation using contextual market information. Neither stage is redundant: removing the LSTM stage leaves the Random Forest without access to temporal pattern information, while removing the Random Forest stage leaves the prediction without contextual grounding.
-
-
SAMIS: A Market Sustainability Scoring Framework
The predicted price p is a point estimate. On its own, it tells a grower what the market is likely to do but says nothing about whether that expectation should change their behavior. A high predicted price in a chronically unstable market carries different implications than the same predicted price in a historically reliable one. A rising price against a backdrop of unusual volatility warrants more caution than the same price increase in a market with consistent trading patterns. These distinctions are the motivation for SAMIS.
SAMIS Sustainability, Analysis, and Market Intelligence Score is a composite index defined on the interval [0, 100]. It decomposes market assessment into three components, each capturing a distinct and orthogonal dimension of market quality: S, measuring how statistically consistent the predicted price is with historical behavior; T, measuring the direction and magnitude of recent price momentum; and R, measuring the underlying volatility of the market as reflected in its historical price dispersion. The composite score is:
SAMIS = S(p , , ) + T(p , p_recent) + R(, )
Each component is computed independently, so that a weakness on one dimension for example, a falling price trend does not artificially inflate or deflate the other components. This independence is an intentional design property: it allows the recommendation engine to respond to the full pattern of component values, not just the composite total.
-
Statistical Foundations
Before defining the components, it is useful to establish the statistical measures on which they depend. Given a historical price series {p_1, p_2, …, p_n} for a particular shopvarietysize combination, three summary statistics are computed.
The mean price represents the central tendency of the
historical market:
= (1/n) 路 _{i=1}^{n} p_i
The standard deviation measures the average magnitude
of price deviations from the mean:
= [ (1/n) 路 _{i=1}^{n} (p_i )虏 ]
The Coefficient of Variation CV expresses standard deviation as a proportion of the mean, producing a scale-independent measure of relative price dispersion:
CV = /
CV is preferred over raw standard deviation for cross-product comparisons because it normalizes volatility by the price level. A market trading at a mean of 200 LKR with = 16 LKR has identical relative volatility to one trading at 100 LKR with = 8 LKR, and CV correctly assigns them the same risk score. Using directly would make the higher-priced variety appear inherently riskier, which would be misleading.
-
Sustainability Component (S)
The Sustainability Component measures the degree to which the predicted price is consistent with the historical distribution of prices for the same product combination. A predicted price that falls well within the normal range of historical variation is statistically plausible it represents a continuation of established market behavior. A predicted price that lies far outside that range, whether unusually high or unusually low, suggests an atypical market condition that may not persist.
Condition
Scor e
Classificatio n
Interpretatio n
p +
40
Highly Sustainable
Predicted price within
the ce tral
68% of
historical distribution
1.5 p + 1.5
30
Sustainable
Moderate deviation from mean; still within
expected range
2 p + 2
20
Moderately Sustainable
Price approaching statistical boundary; unusual but not extreme
p [ 2, + 2]
10
Not Sustainable
Price is a statistical outlier; likely reflects
This assessment is formalized using z-score interval logic, which is grounded in the Empirical Rule for normally distributed data: approximately 68% of historical prices lie within one standard deviation of the mean, roughly 95% within two standard deviations. Predicted prices falling within successively wider intervals therefore represent successively less typical market conditions. The scoring structure is as follows:
transient market anomaly
Table 3. Sustainability Component scoring based on z-score interval analysis.
An important property of this formulation is that it responds symmetrically to outliers in both directions. A predicted price that is unusually high perhaps reflecting a temporary demand surge receives the same low sustainability score as one that is unusually low. From the perspective of market reliability, both extremes represent unstable conditions that may not support a dependable selling decision. The grower is informed not only about the expected price, but about how representative that price is of the market's normal operating range.
The score ceiling of 40 for this component, relative to 30 for each of the other two, reflects the interpretive emphasis placed on statistical price normality in the overall framework. A market whose predicted price falls squarely within historical norms is fundamentally more reliable as a selling environment than one characterized by high momentum or low volatility alone.
-
Trend Component (T)
The Trend Component captures the direction and strength of recent price movement. Rather than evaluating the predictd price against the full historical distribution as the Sustainability Component does this component focuses on the short-term trajectory: where has the price been going over the past week, and does the predicted price continue that direction?
Let p_recent denote the first price in the trailing seven-day window. The percentage change between this reference price and the predicted price is:
% = (p p_recent) / p_recent 脳 100
Conditio n
Trend Label
Scor e
Practical Meaning
% > 3%
Rising
30
Prices are appreciating; selling now or soon captures the upward movement
3%
% 3%
Stable
15
Price is in equilibrium; no strong directional signal either way
% <
3%
Falling
5
Market is contracting; deferring where possible may recover value
Table 4. Trend Component scoring based on percentage price change over the trailing seven-day window.
The 卤3% band around the stable zone is an important design choice. In day-to-day flower markets, price variation of up to
3% is common and does not reliably signal a directional trend
-
it is more accurately characterized as market noise. Classifying every sub-3% movement as a meaningful trend would produce unstable, frequently changing recommendations that erode grower trust and lead to decision paralysis. The band suppresses this noise and ensures that only genuine, sustained price movements trigger Rising or Falling classifications.
The score asymmetry 30 for Rising versus 5 for Falling
-
is intentional and reflects the agricultural market context. For a perishable commodity, a rising price is an active signal that rewards immediate action, while a falling price, though unfavorable, still leaves the grower with limited options if the product cannot be stored. The score differential encodes this asymmetry without eliminating the falling market signal entirely.
-
-
Risk Component (R)
While the Sustainability Component evaluates where the predicted price sits within the historical distribution, and the Trend Component evaluates short-term direction, the Risk Component evaluates the reliability of the market as a whole. Specifically, it asks: regardless of where today's price falls, how predictable and stable has this market historically been?
A market with a long history of tightly clustered prices around the mean is one in which predictions can be trusted and selling decisions can be made with reasonable confidence. A market with large swings in both directions is one in which even an accurate predicted price may be followed immediately by a sharp reversal. The grower's effective risk is higher in the latter case, even if today's predicted price happens to be favorable.
The Coefficient of Variation captures this distinction precisely. By expressing price dispersion as a proportion of the mean, it provides a normalized measure of relative volatility that is comparable across product combinations with different price scales:
CV Range
Risk Level
Score
Market Interpretation
CV < 0.05
Low
30
Highly stable market; price fluctuations are minor and predictions are reliable
0.05 CV
0.10
Medium
20
Moderate volatility; reasonable confidence in forecast with some caution warranted
CV > 0.10
High
10
Volatile market history; predicted price should be treated ith reduced confidence
Table 5. Risk Component scoring based on the Coefficient of Variation.
The CV thresholds of 0.05 and 0.10 represent meaningful boundaries in the context of retail flower markets. A CV below
0.05 implies that prices rarely deviate by more than 5% of the mean a market condition consistent with stable consumer demand and predictable supply. A CV above 0.10 implies that
price swings of more than 10% of the mean are common, indicating a market where external shocks seasonal demand surges, supply disruptions, competitor pricing are frequent enough to make any single-day price forecast inherently uncertain.
-
Composite SAMIS Score and Score Structure
The three components are summed directly to produce the SAMIS score:
SAMIS = S + T + R, 0 SAMIS 100
Component
Min Score
Max Score
Weigh t of
Total
Primary Measure
S
Sustainability
10
40
40%
z-score interval logic
T Trend
5
30
30%
7-day % price change
R Risk
10
30
30%
Coefficien t f
Variation
SAMIS Total
25
100
100%
Composit e index
Table 6. SAMIS component structure, score ranges, and relative contribution to the composite index.
The minimum achievable score of 25 arises from the floor values of each component (S=10, T=5, R=10) and is assigned to markets where the predicted price is a statistical outlier, the recent trend is falling, and historical volatility is high simultaneously. This floor is intentionally non-zero, reflecting the fact that even in severely unfavorable conditions, some residual market information remains available. The maximum of 100 represents a market where the predicted price is fully within the historical normal range, a rising trend is confirmed, and historical volatility is low conditions that together constitute the most reliable possible selling environment.
-
-
Four-Class Market Recommendation
The SAMIS score is not simply thresholded at a single cutoff. A four-class recommendation framework evaluates the composite score alongside the individual component classifications, ensuring that specific combinations of conditions receive appropriate handling rather than being averaged away.
Class 1 Optimal Selling Conditions
The highest recommendation tier requires all three components to reach their maximum scores. Sustainability must be 40 (predicted price within one standard deviation of historical mean), Trend must be 30 (price rising by more than 3%), and Risk must be 30 (CV below 0.05). This triple conjunction ensures that the Optimal classification is reserved for markets that are simultaneously normal, appreciating, and
stable not merely one or two of those things. The typical SAMIS range for Class 1 conditions is 85 to 100.
Class 2 Favorable Selling Conditions
Favorable conditions are identified when at least two components are at or near their maximum values and the third is not at its minimum. This captures markets where the overall picture is positive but one dimension typically either a flat trend or moderate volatility ntroduces some uncertainty. Growers in Class 2 markets are advised to proceed with selling, with awareness that the market is not fully optimal. SAMIS scores in the range of approximately 65 to 84 characterize this class.
Class 3 Moderate Conditions
The Moderate classification covers a broad middle range in which the market is functional but no strong directional case exists for or against immediate selling. This class also captures a specific edge case: when the Sustainability score is exactly 20 meaning the predicted price lies between the 1.5 and 2 boundaries the market receives a Class 3 classification regardless of trend and risk scores, because the marginal statistical plausibility of the predicted price introduces a structural level of uncertainty. SAMIS scores from 40 to 64 typically characterize this class.
Class 4 Unfavorable Conditions
The Unfavorable classification is triggered when the composite SAMIS score falls below 40, or when the Sustainability score reaches its minimum of 10 (predicted price beyond 2), or when both Trend and Risk are simultaneously at their minimum scores. The composite score threshold and the individual component triggers operate as alternative conditions, meaning any one of them is sufficient to generate a Class 4 recommendation. Where post-harvest storage is feasible, the system advises deferring the selling decision until market conditions improve.
Class
SAMI S
Range
S
Score
T
Score
R
Score
Recomme ndation
1 Optimal
85
100
40
30
30
Sell immediate ly
2
Favorable
65
84
30
15
20
Recomme nded to sell
3 Moderate
40
64
20
Any
Any
Monitor; partial sale possible
4
Unfavorable
< 40
10
5
10
Defer if storage allows
Table 7. Four-class market recommendation framework derived from SAMIS component scores.
-
-
RESULTS AND DISCUSSION
-
Price Prediction Performance
The prediction pipeline was evaluated on a held-out test set of Anthurium transaction records not used during training. Comparison was made against an ARIMA baseline, a standalone ensemble regressor trained on the same feature set without the sequential pre-processing layer, and the sequential model operating without ensemble fusion. The fusion architecture produced the most stable predictions across the evaluation period, with notably lower error variance during weeks characterized by demand volatility. The reduction in prediction variance rather than just mean error is of particular practical significance in this context, since growers are harmed not just by average forecast inaccuracy but by unpredictable forecast quality.
Model Configuration
MAE (LKR)
RMSE (LKR)
Error Std. Dev.
ARIMA Baseline
18.42
24.17
15.83
Ensemble Regressor Only
12.65
16.94
11.27
Sequential Model Only
10.38
14.22
9.61
SequentialEnsemble Fusion (Proposed)
6.73
9.48
6.12
Table 8. Predictive accuracy comparison across model configurations.
-
SAMIS Assessment Evaluation
The performance of the SAMIS classification scheme was assessed through analyzing whether SAMIS-classified periods matched up with the market behavior that happened after those times based on previously known information regarding the results. Times classified under Class 1 or Class 2 were compared to the weeks where the subsequent market values did not fall below the forecasted price, indicating that these periods represented a good time for selling. The correlation between SAMIS classification and the previously observed market conditions proved to be significantly greater than what would be expected based solely on the price forecast.
A particularly instructive set of observations involved market periods where the predicted price was high but the Sustainability Component received a score of 10, indicating that the predicted price lay beyond the two standard deviation boundary of historical distribution. In each such case, subsequent market prices returned toward the historical mean within several days, confirming that these episodes represented transient price spikes. The SAMIS framework correctly classified these as unfavorable or moderate
conditions despite the superficially attractive predicted prices
– a distinction that a price-only recommendation system would have missed.
The Risk Component also demonstrated discriminative value at the product-combination level. Varietyshop combinations with high historical CV received consistent Class 3 or Class 4 classifications even during periods of rising trend, reflecting genuine market unpredictability that was subsequently confirmed in price trajectory data. Low-CV combinations, by contrast, received stable Class 1 and Class 2 classifications during normal market periods, providing growers with reliable positive signals when the underlying market conditions genuinely warranted them.
-
Recommendation Engine Behavior
The manner in which the four-class model reacts in extreme situations is something worth looking into, especially because it is during the more ambiguous market situations when the added value of using the recommendation algorithm becomes evident over merely using a score cutoff. For instance, when there is a significant increase in the price level (T = 30), while at the same time there is a statistical outlier for the forecasted price level (S = 10) with high volatility (R = 10), the resultant SAMIS score of 50, if applied to a single-threshold system, will categorize the market situation as moderate. However, the component-based decision-making process will override the result by classifying it under the Class 4 category based solely on its Not Sustainable classification, thus providing valuable advice not to invest in such a market.
-
-
CONCLUSION
In this study, a market intelligence solution for predicting daily prices of Anthurium and assessing their sustainability was introduced, which was developed based on the realistic information-related challenges experienced by small floriculture growers in Sri Lanka. The prediction model utilizes a temporal sequence model in combination with an ensemble regression layer within a sequential fusion framework to identify both the time-dependent features and contextual aspects of the market, generating predictions of daily prices that are more robust against market fluctuations compared to each mode used alone.
The truly valuable component of this study is the introduction of SAMIS, a method for transforming the simple predicted price value into a detailed market analysis along three separate statistical axes. By analyzing the predicted price relative to past values for the exact combination of goods, relative to the current trend in the market, and relative to the volatility of those past prices in the market, SAMIS offers a much richer context for decision making when compared to a simple single-axis system. The classification algorithm applied to the outputs of SAMIS accounts for special cases where increasing prices exist in volatile markets and normal prices exist in steady markets in a way consistent with agricultural economics.
It is important to recognize the limitations of this study. The parameters that were used to define the thresholds of SAMIS
– the standard deviations to determine sustainability, the 卤3% window to determine the nature of the trends, and the CV levels as the indicators of risk were chosen based on considerations of both statistics and the market dynamics, rather than being empirically determined using a validation sample. A more accurate calibration of these parameters using empirical methods can be helpful to achieve better sensitivity of the system in case of abnormal distributions of prices in markets under investigation.
Future research will concentrate on the calibration of thresholds, integration of external signals from the market environment, and applying this methodology to new types of ornamental flowers and market settings. This paper has presented architecture for two different kinds of models, which are applicable not only to the flower business but also any other perishable agricultural commodity markets; the authors believe that this paper will be able to provide the basis for future works.
REFERENCES
-
ochreiter, S., & Schmidhuber, J. (1997). Long short-term memory. Neural Computation, 9(8), 17351780.
-
Breiman, L. (2001). Random forests. Machine Learning, 45(1), 532.
-
Box, G. E. P., & Jenkins, G. M. (1976). Time Series Analysis: Forecasting and Control. Holden-Day, San Francisco.
-
Vapnik, V. N. (1995). The Nature of Statistical Learning Theory. Springer, New York.
-
iaw, A., & Wiener, M. (2002). Classification and regression by randomForest. R News, 2(3), 1822.
-
ers, F. A., Schmidhuber, J., & Cummins, F. (2000). Learning to forget: Continual prediction with LSTM. Neural Computation, 12(10), 24512471.
-
C en, T., & Guestrin, C. (2016). XGBoost: A scalable tree boosting system. Proceedings of the 22nd ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, 785794.
-
uguma, H. T., & Abdi, D. (2019). Analysis of cut flower export market concentration and trend in Ethiopia. Journal of Agricultural Economics, Extension and Rural Development, 7(1), 781788.
-
endoza, M. G., & Rosegrant, M. W. (1995). Pricing conduct of spatially differentiated markets. International Agricultural Trade Research Consortium Working Paper.
-
Friedman, J. H. (2001). Greedy function approximation: A gradient boosting machine. Annals of Statistics, 29(5), 11891232.
-
Shumway, R. H., & Stoffer, D. S. (2017). Time Series Analysis and Its Applications (4th ed.). Springer.
-
Hyndman, R. J., & Koehler, A. B. (2006). Another look at measures of forecast accuracy. International Journal of Forecasting, 22(4), 679688.
-
Abadi, M., et al. (2016). TensorFlow: A system for large-scale machine learning. 12th USENIX Symposium on Operating Systems Design and Implementation, 265283.
-
McKinney, W. (2010). Data structures for statistical computing in Python. Proceedings of the 9th Python in Science Conference, 5156.
-
Kouadio, L., et al. (2018). Artificial intelligence approach for the prediction of Robusta coffee yield using soil fertility properties. Computers and Electronics in Agriculture, 155, 324330.
