Exploring investor-business-market interplay for business success prediction

Gangwani, Divya; Zhu, Xingquan; Furht, Borko

doi:10.1186/s40537-023-00723-6

Research
Open access
Published: 16 April 2023

Exploring investor-business-market interplay for business success prediction

Divya Gangwani¹,
Xingquan Zhu¹ &
Borko Furht¹

Journal of Big Data volume 10, Article number: 48 (2023) Cite this article

2866 Accesses
2 Citations
Metrics details

Abstract

The success of the business directly contributes towards the growth of the nation. Hence it is important to evaluate and predict whether the business will be successful or not. In this study, we use the company’s dataset which contains information from startups to Fortune 1000 companies to create a machine learning model for predicting business success. The main challenge of business success prediction is twofold: (1) Identifying variables for defining business success; (2) Feature selection and feature engineering based on Investor-Business-Market interrelation to provide a successful outcome of the predictive modeling. Many studies have been carried out using only the available features to predict business success, however, there is still a challenge to identify the most important features in different business angles and their interrelation with business success. Motivated by the above challenge, we propose a new approach by defining a new business target based on the definition of business success used in this study and develop additional features by carrying out statistical analysis on the training data which highlights the importance of investments, business, and market features in forecasting business success instead of using only the available features for modeling. Ensemble machine learning methods as well as existing supervised learning methods were applied to predict business success. The results demonstrated a significant improvement in the overall accuracy and AUC score using ensemble methods. By adding new features related to the Investor-Business-Market entity demonstrated good performance in predicting business success and proved how important it is to identify significant relationships between these features to cover different business angles when predicting business success.

Graphical Abstract

Introduction

The success of the business is the main reason for the investors, stakeholders and entrepreneurs to stay in the market and grow their business further. This keeps them motivated to come up with new ideas and innovations which is important for the economic growth of the nation. Hence, investors and stakeholders are in constant need of a method that can predict the performance of their business beforehand. It gives them the advantage to invest wisely and compete in the market with an expectation to achieve considerable returns on their investments [1]. Nowadays, many researchers focus on identifying practical tools and methodologies to determine business success factors. There has been a long history of research that tried to analyze the features or factors that make the business successful [2], however, the previous researches needs to be consistent with the literature and features selected in predicting business success. With the ever-changing economy and business dynamics, there is a need to identify the factors that effectively analyze the rise and fall of the business. We aim to bridge the gap in the literature by identifying the most accurate definition of business success such that it brings more clarity in selecting the most critical features from different business angles which are responsible for creating a successful business and developing additional features to demonstrate the importance of selecting suitable features for modeling and predicting business success. In recent years, several small and mid-size companies are gaining attention due to their capability to capture the market and merge with unicorns to achieve more publicity [3]. With millions of investments made by the investors and its rapid increase in achieving unicorn status, it has become even more challenging to predict whether the business will eventually succeed or fail. There are a lot of factors that can affect the performance of the business such as the sector in which the business operates, the number of employees working for the company, skills of the employee, location, size of the company, competition level, and so on. It is difficult to measure all factors and even more challenging to identify several factors which influence the company’s performance.

Recent studies have many limitations due to the use of only specific features to predict business success. For example, [4] utilized only financial features to determine the rise and fall of the business. This would be a trivial solution when other factors are not considered during the evaluation of companies’ performance. Another study highlighted the use of social media marketing to promote business success [5]. It has been observed that by utilizing social media features, business gains more attention and can directly promote their brands and products to customers. Combining social media features with deep learning algorithms improves market captivity for the firm, which ultimately results in success. Recently [6], proposed a framework for applying investment and business features in conjunction when evaluating key criteria for measuring business success. Business factors such as R &D employees, patents in the company, managerial employees, and company valuation which is an important indicator of business success were applied. When a company reaches a valuation of $1 billion it achieves the status of unicorn which distinguishes them from other companies. All these factors together contribute to the success of the business. Therefore it is important to find a correlation between these features in order to accurately predict business success.

Predicting business success is intuitively important and offers great significance to investors and stakeholders as they can effectively utilize the information to attain competitive advantage through timely analysis and accurate prediction.

Machine learning methods have been used in the past to build predictive models for business success and provide corresponding results and suggestions [6,7,8]. Supervised machine learning methods such as Random Forest, SVM, and Gradient boosting are mostly popularly applied for business prediction using news articles and factual features from company datasets which are publicly available on TechCrunch and Crunchbase websites [9]. In addition, many researchers also proposed neural networks in combination with classification methods to achieve high accuracy when dealing with high cardinality datasets [8]. Despite the growing amount of models built for business success prediction, most of them cannot be applied in practice due to the lack of knowledge about the interrelated features which is an essential requirement for success prediction. Moreover, many methods focus on specific features that define business success [5, 10] and do not take into consideration how other features/factors can play an important role in the decisions making and in turn can result in a biased decision. In addition, many studies [7, 11] gathered data from different sources which included company’s who are still in operating status and does not have enough information to determine their path toward success. Including such information may easily cause issues in trusting the applicability of the results.

In order to accurately predict business success and avoid any kind of bias, there should be a clear definition of success and identify major features and their interrelated sub-features which can be applied in practice to predict business success.

In this paper, we propose a new definition of business success using machine learning techniques to create a predictive model. In our definition, we include companies that have achieved initial public offering (IPO) or have undergone merger and acquisition (M &A) and classify them as successful, and companies that have been closed are classified as failed. Additionally, we use feature engineering techniques to create new features based on three main parities: Investment, Business and Market with a focus on stating the fact that these three entities play a major role in identifying critical factors for business success. Experiments and comparison with baseline demonstrated improved accuracy and AUC score using supervised machine learning algorithms. Ensemble methods, such as Random Forest and XGBoost achieved the best results when compared to other supervised learning methods.

Business success and investor-business-market interplay

Success in general refers to the achievements obtained either by getting some profit or by fulfilling small goals in life. When we consider business success, the overall definition remains the same, however, there are more factors that affect how we measure business success depending on different business angles.

There is a wide variety of research dedicated to analyzing business success [12,13,14]. Some use financial indicators are the major factor when predicting success, while others use companies’ demographic information, human resource details, or past financial records to measure a company’s performance. In a recent study carried out in the European market, [15] suggests that utilizing business features such as human resource, demographics, job skills, team size, management, etc. were crucial in distinguishing the success and failure of companies. This research was also extended for U.S market which highlighted the fact that business Human Resource features are capable of detecting success and failure in companies worldwide. Another study [9] focused on the financial indicators for predicting the success of the firms. This is due to the fact that early startups do not have much information to evaluate their path toward progress and in such cases, financial indicators are more reliable for detecting success or failure. Evidence suggests that when a company reaches a new height, for example, has a valuation of $1 billion known as the unicorn, such companies consider different factors for evaluating their growth and cannot be compared to the growth of start-ups or small businesses as they reach to a new stage in the life cycle of a business and experience an exponential rise in the success [16,17,18]. Hence features such as new innovations, patents, raised amounts by investors, funding amount, market sector, and investor demographics come into play when companies achieve the status of a unicorn. Many studies were conducted on market statistics to see how the market plays an important role in analyzing the growth of the business [19]. Small and medium firms focused on operating on either one industry sector or two with an aim of expanding their business and capturing enough customers so that they can establish their business into one sector firmly, whereas large firms [20]have the capability to capture the market in many industry sectors, such that, even if one sector fails they have more resources and funds to support and grow their businesses in other sectors or industries. For example, a case study done on the European market highlighted the fact that market orientation leads to corporate success in firms [21]. The ultimate goal of achieving success was to narrow down the market and focus on major business dimensions including product cost, customer satisfaction, product environment, technology, and innovations in the business such that the company achieves success in product creation. A successful product in turn leads to successful business due to its capability to capture the market and provide customer satisfaction.

Based on these studies, we conclude that there is not just one factor or a particular set of features that define business success. Several firms are at different stages of their growth and have different factors influencing their business decisions and considering major factors together can contribute in evaluating business success.

In this paper, we consider all types of firms such as small and mid-size firms, startups, unicorns and large companies, when defining business success using machine learning models. In order to measure success, the outcome of the business having a status of either an IPO or Merger and Acquisition (M &A) is considered as a variable of business success and the businesses that have the outcome of closed are considered as failure which becomes our target variable for binary classification problem. We also highlight that several factors or features mentioned in the previous researches are interrelated and cannot be considered separately as the dynamics of business keeps changing but the factors which determine success or failure remain certain. Hence with the above observation, we design an approach to divide these features into three main parities: Investor, Business and Market, and demonstrate how these features together contribute towards evaluating business success irrespective of the type and the size of the business.

Business success

The success of the business is defined by the status of the company given in the dataset used for experimentation. The company status is divided into four categories: (1) Operating; (2) IPO; (3) Acquired; and (4) Closed as n in Fig. 1. A company gets the status of operating during its early stages of development or if they are just a survival company and there is not much information available to determine whether these companies will eventually fail or succeed. IPO and Acquired are clear statuses to determine whether these companies have been successful and have received enough funding or have a valuation of a huge amount. When a company goes public they receive the status of IPO which means that they release its portion of funds in the public market with an aim to achieve a huge price gain. Merger and Acquisition (M &A) occurs when a company of the same level gets acquired or merged with another company of a similar level such as Google, Amazon etc. Therefore when a company achieves a status of either IPO or acquired, it is a clear distinction that these companies have enough funding to grow in the market and achieve success whereas the closed status is given to the companies that are no longer operating or have failed to survive. Depending on the company dynamics and having a clear objective of predicting the success of the business, it is important to classify the companies and label them into two categories: success or failure. Based on the dataset used for experiments and keeping the important and relevant feature intact, we selected companies that achieved a status of IPO or Acquisition and labeled them as positive class and the companies that had the status of closed were labeled as negative class 1. The companies that were in the status of operating were excluded from the training set due to the lack of information available to determine whether the companies would be successful or not 1. Keeping such information in the training process includes some kind of bias which may not produce relevant results for comparison. Hence a significant portion of companies were removed from the training set in order to accurately classify and predict business success.

Investor-business-market triangle

In business prediction, the success of the firms depends on three main entities: Investor, Business, and Market which forms an interrelated triangle as shown in Fig. 2 since these three entities together contribute towards the success of the business. The relationship between these three entities has been supported by wide range of publications [22,23,24,25] which shows that when determining whether a business will succeed or fail, these three entities should be taken into account and ignoring any one of of these aspects could lead to an unsatisfactory outcome.

According to the supported literature, investor and the market have a close relationship in contributing towards the economy of the company. In these studies [22, 24], the use of technology in promoting market is directly related to the performance of the business. Information Technology (IT) has changed the ways of how business used to operate. It brings in more employment and investments into the company. Using technology in different market sectors encourages investors to bring more investments into the business which in turn attracts more customers. There are several strategic factors which influence the performance of the company. The four deterministic factors includes: Business demographics, product innovations, market strategy and market trends together analyze the shift in the performance of the company. In the telecommunication industry in Africa [26], statistical analysis was carried out to evaluate the main factors responsible for business performance. In order to evaluate critical factors, customer reviews were taken into account and a business design was prepared to carry out the pros on cons of several factors affecting the industry sector of the company. Focusing on factors such as strategic design, innovation and product creation highlighted the trends in the market and demonstrated how business and market relation led the company to reconsider their failure points and make changes to incorporate different business angles which showed successful innovations in the industry.

A study based on the performance of UK based companies gathered evidence demonstrating how market orientation is directly associated with company’s performance [27]. Factors such as estimated product cost, demographics of company, financial investments etc. contributes towards the performance of the company.

Usually, when a business succeeds it is not because of just one factor, but several factors contribute in combination to the success of the business. As a result, the IBM triangle acts as a shield for the key elements involved in achieving company’s success.

Once we analyze the IBM interplay and define the most important features related to these entities in our dataset, the machine learning model provides enhanced results when predicting business success.

Investor features

Investor features include three main aspects of the business, Investor demographics, Investor sector and Investor financial information. These three main features helps to answer question such as which business sector has more growth? How many investors invested in the market sector? What is the amount of investments made by the investor? Having answers to such questions provides entrepreneurs with more information about whether the business will get repeatable returns on their investments to better assess their risk of investments into the business. Having investor information related to the business and market sector helps in reducing the risk of uncertainty that comes with every investment made into the business.

The investor demographic features include information of the investor’s location, city, country, and other personal information about the investor. Top investors receive enough recognition over the years such that they always have an edge towards forward-thinking about the growth of the firms. Figure 4 highlights the top 10 investors in the dataset based on the amount of funding raised by the investors. A recent study analyzed how investors’ demographics directly affect the performance of financial sector industry. A stock market industry conducted an annual evaluation to come up with factors affecting their stock price investments [28]. The research showed how demographic factors including age, locality, education level etc., strongly influenced an investor’s decision to buy or sell stocks.

Investors usually keep a track of recent market trends and customer behavior before making any decision to invest in the company. Hence, evaluating the recent trends in the market sector provides more confidence to the investors to invest into the business. As shown in Fig. 4, we provided statistical analysis on the dataset to include investor features which highlights the top market sectors that received majority of the funding. Semiconductor, Biotechnology and Software are one of the top 3 sectors that received majority of the funding by investors. Investing more money into different markets sector increases the price of those products and this provides an upward momentum for the company to keep growing over the years and maintain their success in the market. Another major aspect of the Investment feature is the financial information about the investor. The financial information includes the capability of an investor to raise more amount in the market and increase the rounds of funding such that the business and the investor achieve maximum profit. Other features such as the funding amount raised by the investor in each sector, type of funding received by the company, returns on investment etc., provides enough evidence to predict the company’s likelihood of success. As shown in Fig. 3, the companies that receives the type of funding such as the Venture Capitalist (VC) funds have higher chances of being public and have faster growth rate as compared to companies backed by either seed or other types of funds [29]. Angel funding is another common type of funds that provides more chances for the company to survive the market risks and growth eventually in terms of more employment, sales and financing [30]. Early startups are in need of such funding and would benefit from a VC fund or any other funding as young firms are more driven by latest technology, ideas and innovations. Investors constantly look for such new innovations and are ready to provide initial funding in exchange for a percentage of profit with these firms [31].

Business features

Business Features refer to the company’s personal information such as product details, human resources, business demographics, innovations, financial information, and so on. These features are further subdivided into a number of sub-features containing information about the companies. Many studies highlighted the importance of business features when predicting business success. Early-stage startups or small businesses have very little information about major investors or market characteristics due to the limited financial availability to explore larger areas of growth into the business [11]. New ventures or startups rely on the entrepreneur’s techniques and vision to make the business successful. The innovations, new technology, and creativity of an entrepreneur are leading factors for a firm’s success since new innovations or technological changes attract customers and generate profits [32,33,34]. Another study highlighted that mandating corporate social responsibility on investments made the firm an important aspect in increasing the economy of the nation. A study was done on India’s emerging market which stated that the governing body has a social responsibility to learn and adapt certain policies to create a sustainable environment for the businesses such that firms gain an advantage and achieve profitability over the years [35]. Hence when evaluating the business success of startups or small firms, business features such as business demographics and founders’ vision as well as support from the government plays an important role in giving enough information to make a successful prediction. On the other hand, for large firms, there is a need for more information such as financial features and market trends including business features to evaluate and predict business success. Regardless of the type of firms, business features serves as a common point or a major requirement when predicting business success. A study using business demographic features and human resource features such as company age, team size, number of staff, education level etc., on an established company dataset in US and Europe showed how human resource features were common predictors of success or failure of the firms [15]. This evidence led to the belief that human resource factors needs to be considered as an important resource when predicting business success. Another study on large corporate firms examined several financial indicators for predicting business success [36]. Large firms are more capable of generating profits and hence factors such as returns on investment, capital shares, amount of funding received etc., are key indicators for predicting business success.

Motivated by the previous studies, we identify all the important business features available in the dataset and provide a statistical analysis which gives useful insights of the variables when developing a predictive model. For example, the analysis done on business sectors in the company demonstrates that most of the funding goes to the top business sectors that are in high demand in the market as shown in Fig. 5. Another important factor for growth of the business is the demographics of the company that have been highlighted in Fig. 6, which highlights that U.S has highest number of companies that are either startups or operating and within the U.S, California has the majority of headquarters locations. The sector in which the business operates is one of the important features when predicting the performance of the company as business sectors provide a sustainable environment for the businesses to flourish [37]. With the ever-changing market, it is essential for the companies to keep a track of those changes and shift their investments strategies or switch funds based on the predictive methods used for analyzing market trends.

Market features

Market features are one of the main aspects of the business that decides the rise and fall of a company. Factors such as market pricing, competitive strength, market size, market digitization, demand, and demographics determine the trends in the market and attract customers to generate market advantage for their business.

Recently, external factors have contributed more towards the shift in the market which has led the entrepreneurs to change their strategies and decisions and share their vision with other stakeholders to keep up with the ups and downs in the business [38]. Market shifts due to external factors such as Covid-19 outbreak, inflation etc, have led to job layoffs, disruption in the commercial sectors, and shut down many high-growing sectors worldwide. For example, the tourism industry have seen a sharp decline during Covid-19 pandemic whereas the pharmaceutical industries have seen a rise in their profit ratios [39].

Market trends constantly evolve with the changing time to fulfill customers’ needs which gives an edge to the companies in the competitive environment. For example market digitization brings new economic development by spreading the market internationally to capture a larger customer base [40]. Hence entrepreneurs are looking for constant innovations and the latest technologies being used for product development to build a successful brand and maintain a long-term relationship with the customers. Figure 7 shows the market trends of top industrial sectors based on the revenue generated in the 2022 quarter results. As shown in the figure, Technology has generated the maximum revenue followed by Retail, Finance sector, and so on [41]. In order to evaluate market trends, investors collect information about the revenue generated in these sectors, use of latest technology, measure innovation as well as evaluate the stock market trends over the years. These key features provide the investor with all the information needed to make an effective decision about investments in the business. Figure 8 highlights the percentage of market capitalization over the years for different market sectors showing the trends of market shift from 1900 to 2018 [42]. For example, the Transportation sector was significantly higher than other sections during the 1900 s but then it experienced a major decline in the 2000 s. Similarly, Information Technology had a new boom in the late 1900s and has since continued to expand. The plot shows that it is important to keep up with the market trends and other important market characteristics which make or break the business industry.

Proposed framework

In this section, we describe the proposed framework for business success prediction including the features used for learning and the prediction framework for modeling.

Features for learning

The features for machine learning are the main step in the proposed framework for analyzing business success. The dataset from the company and the investor file contains detailed information about companies, Their demographic information, funding information, market sectors, and investor details. These files are exported and merged with a unique identifier known as permalink to extract relevant features and make them ready for modeling. Three types of features are extracted from the dataset including the investor, business, and market features which describes the correlation between IBM entities in our prediction model.