How to read a tweet that contains a curly apostrophe (’)





.everyoneloves__top-leaderboard:empty,.everyoneloves__mid-leaderboard:empty,.everyoneloves__bot-mid-leaderboard:empty{ height:90px;width:728px;box-sizing:border-box;
}







0















I am reading tweets in the following format:



545253503963516928|Wed Dec 17 16:25:40 +0000 2014|Massachusetts Pharmacy Owners Arrested in Meningitis Deaths http://xxxxxxxxx
545235402156937217|Wed Dec 17 15:13:44 +0000 2014|For First Time, Treatment Helps Patients With Worst Kind of Stroke, Study Says http://xxxxxxxxx


The code I'm using is



msn  <- read.table(file=".../msnhealthnews.txt", 
sep="|",
header = FALSE,
quote="",
fill=TRUE,
stringsAsFactors = FALSE,
numerals ="no.loss",
encoding = "UTF-8")


There is a tweet that has a curly apostrophe:



You’re Already Losing Your Mind:  http://on-msn.com/w0LiSx


This tweet is being read as follows:



"Youu0092re Already Losing Your Mind:  http://on-msn.com/w0LiSx"


How I can ensure that the tweet is read correctly? I thought that setting encoding = "UTF-8" would take care of this.










share|improve this question























  • Please define "correctly". You seem to want magical things to happen. Magic, sadly, isn't real.

    – hrbrmstr
    Nov 22 '18 at 2:50













  • Please use encoding = utf-8

    – Hunaidkhan
    Nov 22 '18 at 4:47











  • try running through the other encodings? Once I found that UTF-10 worked for me

    – RAB
    Nov 22 '18 at 7:26











  • @hrbrmstr: by correctly, I mean that You're should be read as such and not as Youu0092re I want to be able determine the total number of characters in each tweet using nchar(msn$V3). However, this throws up an error when it encounters Youu0092re

    – Anonymouse
    Nov 22 '18 at 18:37




















0















I am reading tweets in the following format:



545253503963516928|Wed Dec 17 16:25:40 +0000 2014|Massachusetts Pharmacy Owners Arrested in Meningitis Deaths http://xxxxxxxxx
545235402156937217|Wed Dec 17 15:13:44 +0000 2014|For First Time, Treatment Helps Patients With Worst Kind of Stroke, Study Says http://xxxxxxxxx


The code I'm using is



msn  <- read.table(file=".../msnhealthnews.txt", 
sep="|",
header = FALSE,
quote="",
fill=TRUE,
stringsAsFactors = FALSE,
numerals ="no.loss",
encoding = "UTF-8")


There is a tweet that has a curly apostrophe:



You’re Already Losing Your Mind:  http://on-msn.com/w0LiSx


This tweet is being read as follows:



"Youu0092re Already Losing Your Mind:  http://on-msn.com/w0LiSx"


How I can ensure that the tweet is read correctly? I thought that setting encoding = "UTF-8" would take care of this.










share|improve this question























  • Please define "correctly". You seem to want magical things to happen. Magic, sadly, isn't real.

    – hrbrmstr
    Nov 22 '18 at 2:50













  • Please use encoding = utf-8

    – Hunaidkhan
    Nov 22 '18 at 4:47











  • try running through the other encodings? Once I found that UTF-10 worked for me

    – RAB
    Nov 22 '18 at 7:26











  • @hrbrmstr: by correctly, I mean that You're should be read as such and not as Youu0092re I want to be able determine the total number of characters in each tweet using nchar(msn$V3). However, this throws up an error when it encounters Youu0092re

    – Anonymouse
    Nov 22 '18 at 18:37
















0












0








0








I am reading tweets in the following format:



545253503963516928|Wed Dec 17 16:25:40 +0000 2014|Massachusetts Pharmacy Owners Arrested in Meningitis Deaths http://xxxxxxxxx
545235402156937217|Wed Dec 17 15:13:44 +0000 2014|For First Time, Treatment Helps Patients With Worst Kind of Stroke, Study Says http://xxxxxxxxx


The code I'm using is



msn  <- read.table(file=".../msnhealthnews.txt", 
sep="|",
header = FALSE,
quote="",
fill=TRUE,
stringsAsFactors = FALSE,
numerals ="no.loss",
encoding = "UTF-8")


There is a tweet that has a curly apostrophe:



You’re Already Losing Your Mind:  http://on-msn.com/w0LiSx


This tweet is being read as follows:



"Youu0092re Already Losing Your Mind:  http://on-msn.com/w0LiSx"


How I can ensure that the tweet is read correctly? I thought that setting encoding = "UTF-8" would take care of this.










share|improve this question














I am reading tweets in the following format:



545253503963516928|Wed Dec 17 16:25:40 +0000 2014|Massachusetts Pharmacy Owners Arrested in Meningitis Deaths http://xxxxxxxxx
545235402156937217|Wed Dec 17 15:13:44 +0000 2014|For First Time, Treatment Helps Patients With Worst Kind of Stroke, Study Says http://xxxxxxxxx


The code I'm using is



msn  <- read.table(file=".../msnhealthnews.txt", 
sep="|",
header = FALSE,
quote="",
fill=TRUE,
stringsAsFactors = FALSE,
numerals ="no.loss",
encoding = "UTF-8")


There is a tweet that has a curly apostrophe:



You’re Already Losing Your Mind:  http://on-msn.com/w0LiSx


This tweet is being read as follows:



"Youu0092re Already Losing Your Mind:  http://on-msn.com/w0LiSx"


How I can ensure that the tweet is read correctly? I thought that setting encoding = "UTF-8" would take care of this.







r






share|improve this question













share|improve this question











share|improve this question




share|improve this question










asked Nov 22 '18 at 1:39









AnonymouseAnonymouse

528




528













  • Please define "correctly". You seem to want magical things to happen. Magic, sadly, isn't real.

    – hrbrmstr
    Nov 22 '18 at 2:50













  • Please use encoding = utf-8

    – Hunaidkhan
    Nov 22 '18 at 4:47











  • try running through the other encodings? Once I found that UTF-10 worked for me

    – RAB
    Nov 22 '18 at 7:26











  • @hrbrmstr: by correctly, I mean that You're should be read as such and not as Youu0092re I want to be able determine the total number of characters in each tweet using nchar(msn$V3). However, this throws up an error when it encounters Youu0092re

    – Anonymouse
    Nov 22 '18 at 18:37





















  • Please define "correctly". You seem to want magical things to happen. Magic, sadly, isn't real.

    – hrbrmstr
    Nov 22 '18 at 2:50













  • Please use encoding = utf-8

    – Hunaidkhan
    Nov 22 '18 at 4:47











  • try running through the other encodings? Once I found that UTF-10 worked for me

    – RAB
    Nov 22 '18 at 7:26











  • @hrbrmstr: by correctly, I mean that You're should be read as such and not as Youu0092re I want to be able determine the total number of characters in each tweet using nchar(msn$V3). However, this throws up an error when it encounters Youu0092re

    – Anonymouse
    Nov 22 '18 at 18:37



















Please define "correctly". You seem to want magical things to happen. Magic, sadly, isn't real.

– hrbrmstr
Nov 22 '18 at 2:50







Please define "correctly". You seem to want magical things to happen. Magic, sadly, isn't real.

– hrbrmstr
Nov 22 '18 at 2:50















Please use encoding = utf-8

– Hunaidkhan
Nov 22 '18 at 4:47





Please use encoding = utf-8

– Hunaidkhan
Nov 22 '18 at 4:47













try running through the other encodings? Once I found that UTF-10 worked for me

– RAB
Nov 22 '18 at 7:26





try running through the other encodings? Once I found that UTF-10 worked for me

– RAB
Nov 22 '18 at 7:26













@hrbrmstr: by correctly, I mean that You're should be read as such and not as Youu0092re I want to be able determine the total number of characters in each tweet using nchar(msn$V3). However, this throws up an error when it encounters Youu0092re

– Anonymouse
Nov 22 '18 at 18:37







@hrbrmstr: by correctly, I mean that You're should be read as such and not as Youu0092re I want to be able determine the total number of characters in each tweet using nchar(msn$V3). However, this throws up an error when it encounters Youu0092re

– Anonymouse
Nov 22 '18 at 18:37














0






active

oldest

votes












Your Answer






StackExchange.ifUsing("editor", function () {
StackExchange.using("externalEditor", function () {
StackExchange.using("snippets", function () {
StackExchange.snippets.init();
});
});
}, "code-snippets");

StackExchange.ready(function() {
var channelOptions = {
tags: "".split(" "),
id: "1"
};
initTagRenderer("".split(" "), "".split(" "), channelOptions);

StackExchange.using("externalEditor", function() {
// Have to fire editor after snippets, if snippets enabled
if (StackExchange.settings.snippets.snippetsEnabled) {
StackExchange.using("snippets", function() {
createEditor();
});
}
else {
createEditor();
}
});

function createEditor() {
StackExchange.prepareEditor({
heartbeatType: 'answer',
autoActivateHeartbeat: false,
convertImagesToLinks: true,
noModals: true,
showLowRepImageUploadWarning: true,
reputationToPostImages: 10,
bindNavPrevention: true,
postfix: "",
imageUploader: {
brandingHtml: "Powered by u003ca class="icon-imgur-white" href="https://imgur.com/"u003eu003c/au003e",
contentPolicyHtml: "User contributions licensed under u003ca href="https://creativecommons.org/licenses/by-sa/3.0/"u003ecc by-sa 3.0 with attribution requiredu003c/au003e u003ca href="https://stackoverflow.com/legal/content-policy"u003e(content policy)u003c/au003e",
allowUrls: true
},
onDemand: true,
discardSelector: ".discard-answer"
,immediatelyShowMarkdownHelp:true
});


}
});














draft saved

draft discarded


















StackExchange.ready(
function () {
StackExchange.openid.initPostLogin('.new-post-login', 'https%3a%2f%2fstackoverflow.com%2fquestions%2f53422763%2fhow-to-read-a-tweet-that-contains-a-curly-apostrophe%23new-answer', 'question_page');
}
);

Post as a guest















Required, but never shown

























0






active

oldest

votes








0






active

oldest

votes









active

oldest

votes






active

oldest

votes
















draft saved

draft discarded




















































Thanks for contributing an answer to Stack Overflow!


  • Please be sure to answer the question. Provide details and share your research!

But avoid



  • Asking for help, clarification, or responding to other answers.

  • Making statements based on opinion; back them up with references or personal experience.


To learn more, see our tips on writing great answers.




draft saved


draft discarded














StackExchange.ready(
function () {
StackExchange.openid.initPostLogin('.new-post-login', 'https%3a%2f%2fstackoverflow.com%2fquestions%2f53422763%2fhow-to-read-a-tweet-that-contains-a-curly-apostrophe%23new-answer', 'question_page');
}
);

Post as a guest















Required, but never shown





















































Required, but never shown














Required, but never shown












Required, but never shown







Required, but never shown

































Required, but never shown














Required, but never shown












Required, but never shown







Required, but never shown







Popular posts from this blog

Guess what letter conforming each word

Run scheduled task as local user group (not BUILTIN)

Port of Spain