Bohdan Pavlyshenko

SI
h-index10
7papers
24citations
Novelty16%
AI Score19

7 Papers

LGNov 21, 2024
Predictive Analytics of Air Alerts in the Russian-Ukrainian War

Demian Pavlyshenko, Bohdan Pavlyshenko

The paper considers exploratory data analysis and approaches in predictive analytics for air alerts during the Russian-Ukrainian war which broke out on Feb 24, 2022. The results illustrate that alerts in regions correlate with one another and have geospatial patterns which make it feasible to build a predictive model which predicts alerts that are expected to take place in a certain region within a specified time period. The obtained results show that the alert status in a particular region is highly dependable on the features of its adjacent regions. Seasonality features like hours, days of a week and months are also crucial in predicting the target variable. Some regions highly rely on the time feature which equals to a number of days from the initial date of the dataset. From this, we can deduce that the air alert pattern changes throughout the time.

SIOct 13, 2013
Can Twitter Predict Royal Baby's Name ?

Bohdan Pavlyshenko

In this paper, we analyze the existence of possible correlation between public opinion of twitter users and the decision-making of persons who are influential in the society. We carry out this analysis on the example of the discussion of probable name of the British crown baby, born in July, 2013. In our study, we use the methods of quantitative processing of natural language, the theory of frequent sets, the algorithms of visual displaying of users' communities. We also analyzed the time dynamics of keyword frequencies. The analysis showed that the main predictable name was dominating in the spectrum of names before the official announcement. Using the theories of frequent sets, we showed that the full name consisting of three component names was the part of top 5 by the value of support. It was revealed that the structure of dynamically formed users' communities participating in the discussion is determined by only a few leaders who influence significantly the viewpoints of other users.

SIOct 13, 2013
Forecasting of Events by Tweet Data Mining

Bohdan Pavlyshenko

This paper describes the analysis of quantitative characteristics of frequent sets and association rules in the posts of Twitter microblogs related to different event discussions. For the analysis, we used a theory of frequent sets, association rules and a theory of formal concept analysis. We revealed the frequent sets and association rules which characterize the semantic relations between the concepts of analyzed subjects. The support of some frequent sets reaches its global maximum before the expected event but with some time delay. Such frequent sets may be considered as predictive markers that characterize the significance of expected events for blogosphere users. We showed that the time dynamics of confidence in some revealed association rules can also have predictive characteristics. Exceeding a certain threshold may be a signal for corresponding reaction in the society within the time interval between the maximum and the probable coming of an event. In this paper, we considered two types of events: the Olympic tennis tournament final in London, 2012 and the prediction of Eurovision 2013 winner.

IRMay 30, 2013
Tweets Miner for Stock Market Analysis

Bohdan Pavlyshenko

In this paper, we present a software package for the data mining of Twitter microblogs for the purpose of using them for the stock market analysis. The package is written in R langauge using apropriate R packages. The model of tweets has been considered. We have also compared stock market charts with frequent sets of keywords in Twitter microblogs messages.

SIFeb 8, 2013
Data Mining of the Concept "End of the World" in Twitter Microblogs

Bohdan Pavlyshenko

This paper describes the analysis of quantitative characteristics of frequent sets and association rules in the posts of Twitter microblogs, related to the discussion of "end of the world", which was allegedly predicted on December 21, 2012 due to the Mayan calendar. Discovered frequent sets and association rules characterize semantic relations between the concepts of analyzed subjects.The support for some fequent sets reaches the global maximum before the expected event with some time delay. Such frequent sets may be considered as predictive markers that characterize the significance of expected events for blogosphere users. It was shown that time dynamics of confidence of some revealed association rules can also have predictive characteristics. Exceeding a certain threshold, it may be a signal for the corresponding reaction in the society during the time interval between the maximum and probable coming of an event.

CLOct 30, 2012
The Model of Semantic Concepts Lattice For Data Mining Of Microblogs

Bohdan Pavlyshenko

The model of semantic concept lattice for data mining of microblogs has been proposed in this work. It is shown that the use of this model is effective for the semantic relations analysis and for the detection of associative rules of key words.

CLOct 22, 2012
Classification Analysis Of Authorship Fiction Texts in The Space Of Semantic Fields

Bohdan Pavlyshenko

The use of naive Bayesian classifier (NB) and the classifier by the k nearest neighbors (kNN) in classification semantic analysis of authors' texts of English fiction has been analysed. The authors' works are considered in the vector space the basis of which is formed by the frequency characteristics of semantic fields of nouns and verbs. Highly precise classification of authors' texts in the vector space of semantic fields indicates about the presence of particular spheres of author's idiolect in this space which characterizes the individual author's style.