Jia Yang
正在建设中...
BIO
Who I Am
A curious data analyst with finance background interested in translating complex datasets into insightful stories. Skilled in R, SQL, Machine Learning, Python, Advanced Excel, Data Analysis, Data Visualization, Database Design. I am a motivated and goal-oriented individual, who genuinely cares about what I do. Current M.S. student at Columbia University.
Columbia University MS
Iowa State University BS
MY PROJECTS
April 2019
By designing a clear database for automatically storing and easily retrieving the data, our work will help the company keep track of basic transactions, provide information on borrowers and investors, and generate insights through various analytical procedures that will help the company run the business more efficiently, help managers and employees make better decisions, mitigate investment risks, and at last improve the whole decision-making process.
The goal of this project is to examine whether President Trump’s tweets is a source of influence for the stocks market in the United States. The effect of tweets can be measured by coinciding changes in the stock market or changes in the share prices of certain companies targeted in the President’s Tweets. Of the roughly 34,000 tweets used in this research between include were examined using keywords sentiment analysis to find which the overall tweet sentiment. Once determined, we selected the tweets representing the strongest positive and negative emphasis to use as our designed intervention point. We will also use the Causal Impact Timeseries Algorithm developed by Google to validate the pre-conceived relationship between Trump’s twitter attacks/praises directed at a company and its resulting market performance in the stocks.
PREDICTING AIRBNB PRICE IN NEW YORK CITY (RANK TOP 2%)
December 2018
Pricing a rental property on Airbnb is a challenging task for the owner. On the other hand, customers have to evaluate an offered price with minimal knowledge of an optimal value for the property. This project aims to develop a price prediction model using a range of methods from linear regression to random forest, , XGBoost and support vector machine (SVM) to tackle this challenge.
DATABASE DESIGN AND ANALYTICS FOR PROSPER
THE IMPACT OF SOCIAL MEDIA (DONALD TRUMP’S TWEETS) ON FINANCIAL MARKET
March 2019
STATE FARM DISTRACTED DRIVER DETECTION
May 2019
BLOG
THE SCIENCE OF ORGANIZATIONAL CHANGE
2019
There is a gap between where most organizations are today and where they will need to be to succeed in the coming decade. The companies that win in the 2020s will be designed to constantly learn and adapt to changing realities, combine artificial and human intelligence in new ways, and harness the benefits of broader business ecosystems. Reaching this necessary future state will require a fundamental transformation.