Course Name: Natural Language Processing in Trading, Section No: 3, Unit No: 3, Unit type: Notebook
Hi Team,
Good Day,
I have two problems here:
- I have two data sets: one (news_headline_sentiments_aapl.csv: apple stock sentiment data) is from Quantra(QuantInsti) and another (news_headline_sentiment_data_combined_2014_2015_2016_aapl.csv: apple stock sentiment data) which I prepared this data(taken from source:https://sites.google.com/view/headlinedataset/home) from scratch through data preprocessing. I have prepared the dataset for the combined 2014, 2015, 2016(But that doesn't matter). In my dataset, there's two-column ["start_time_stamp" and "end_time_stamp"], whereas in quantra's dataset there's only one column called ["time_stamp"]..so could you please confirm whether the ["time|_stamp"] column from Quantra dataset is same as ["start_time_stamp"] from my dataset?
- In my dataset, there's no "sentiment_class" and "sentiment_score" column. So I'm very curious to know the procedure to calculate "sentiment_class" and "sentiment_score" from scratch.
Apple Stock Data: 1 csv with sentiment class and score and 1 csv without sentiment class and score