Image Annotation for Large Dataset











up vote
0
down vote

favorite












Screenshot



I have this huge data set for which I have only taken a sample from that data to show you,now as you can see it has two class cat and dog where in the training data i have to label it manually since the cat and dog images are mixed, so is there any alternative way to do it.I have to annotate this then only i can train as to if whether cat or dog.










share|improve this question
























  • How accurate must the labelling be? If you can have some error rate, then you could always label automatically with an available deep network; many deep networks are quite accurate for dog and cat images, since these are often well represented in common training datasets.
    – Mozglubov
    Nov 15 at 21:35










  • Hi Mozglubov, in order to model as from the training set i have to split this as cat and dog first and for larger images in a directory i cannot manually annotate each one as cat.1,cat.2 or dog.1,dog.2,so is this there any other alternative for this or i have to manually do this?
    – S L SREEJITH
    2 days ago










  • Well, what I was trying to get at is whether or not you need that exact annotation. Depending on what you plan to do with this dataset, it is sometimes sufficient to have labels which are mostly correct. This is sometimes referred to as "soft" or "noisy" labelling. This paper by Reed et al. (arxiv.org/abs/1412.6596) provides a good example of achieving strong results despite not having fully accurate training labels. If you absolutely do need a human-accurate set of annotations but cannot do it yourself, then you may want to use services like Amazon's Mechanical Turk.
    – Mozglubov
    2 days ago















up vote
0
down vote

favorite












Screenshot



I have this huge data set for which I have only taken a sample from that data to show you,now as you can see it has two class cat and dog where in the training data i have to label it manually since the cat and dog images are mixed, so is there any alternative way to do it.I have to annotate this then only i can train as to if whether cat or dog.










share|improve this question
























  • How accurate must the labelling be? If you can have some error rate, then you could always label automatically with an available deep network; many deep networks are quite accurate for dog and cat images, since these are often well represented in common training datasets.
    – Mozglubov
    Nov 15 at 21:35










  • Hi Mozglubov, in order to model as from the training set i have to split this as cat and dog first and for larger images in a directory i cannot manually annotate each one as cat.1,cat.2 or dog.1,dog.2,so is this there any other alternative for this or i have to manually do this?
    – S L SREEJITH
    2 days ago










  • Well, what I was trying to get at is whether or not you need that exact annotation. Depending on what you plan to do with this dataset, it is sometimes sufficient to have labels which are mostly correct. This is sometimes referred to as "soft" or "noisy" labelling. This paper by Reed et al. (arxiv.org/abs/1412.6596) provides a good example of achieving strong results despite not having fully accurate training labels. If you absolutely do need a human-accurate set of annotations but cannot do it yourself, then you may want to use services like Amazon's Mechanical Turk.
    – Mozglubov
    2 days ago













up vote
0
down vote

favorite









up vote
0
down vote

favorite











Screenshot



I have this huge data set for which I have only taken a sample from that data to show you,now as you can see it has two class cat and dog where in the training data i have to label it manually since the cat and dog images are mixed, so is there any alternative way to do it.I have to annotate this then only i can train as to if whether cat or dog.










share|improve this question















Screenshot



I have this huge data set for which I have only taken a sample from that data to show you,now as you can see it has two class cat and dog where in the training data i have to label it manually since the cat and dog images are mixed, so is there any alternative way to do it.I have to annotate this then only i can train as to if whether cat or dog.







annotations computer-vision transfer-learning






share|improve this question















share|improve this question













share|improve this question




share|improve this question








edited Nov 8 at 13:20









Dave

1,91051322




1,91051322










asked Nov 8 at 11:46









S L SREEJITH

12




12












  • How accurate must the labelling be? If you can have some error rate, then you could always label automatically with an available deep network; many deep networks are quite accurate for dog and cat images, since these are often well represented in common training datasets.
    – Mozglubov
    Nov 15 at 21:35










  • Hi Mozglubov, in order to model as from the training set i have to split this as cat and dog first and for larger images in a directory i cannot manually annotate each one as cat.1,cat.2 or dog.1,dog.2,so is this there any other alternative for this or i have to manually do this?
    – S L SREEJITH
    2 days ago










  • Well, what I was trying to get at is whether or not you need that exact annotation. Depending on what you plan to do with this dataset, it is sometimes sufficient to have labels which are mostly correct. This is sometimes referred to as "soft" or "noisy" labelling. This paper by Reed et al. (arxiv.org/abs/1412.6596) provides a good example of achieving strong results despite not having fully accurate training labels. If you absolutely do need a human-accurate set of annotations but cannot do it yourself, then you may want to use services like Amazon's Mechanical Turk.
    – Mozglubov
    2 days ago


















  • How accurate must the labelling be? If you can have some error rate, then you could always label automatically with an available deep network; many deep networks are quite accurate for dog and cat images, since these are often well represented in common training datasets.
    – Mozglubov
    Nov 15 at 21:35










  • Hi Mozglubov, in order to model as from the training set i have to split this as cat and dog first and for larger images in a directory i cannot manually annotate each one as cat.1,cat.2 or dog.1,dog.2,so is this there any other alternative for this or i have to manually do this?
    – S L SREEJITH
    2 days ago










  • Well, what I was trying to get at is whether or not you need that exact annotation. Depending on what you plan to do with this dataset, it is sometimes sufficient to have labels which are mostly correct. This is sometimes referred to as "soft" or "noisy" labelling. This paper by Reed et al. (arxiv.org/abs/1412.6596) provides a good example of achieving strong results despite not having fully accurate training labels. If you absolutely do need a human-accurate set of annotations but cannot do it yourself, then you may want to use services like Amazon's Mechanical Turk.
    – Mozglubov
    2 days ago
















How accurate must the labelling be? If you can have some error rate, then you could always label automatically with an available deep network; many deep networks are quite accurate for dog and cat images, since these are often well represented in common training datasets.
– Mozglubov
Nov 15 at 21:35




How accurate must the labelling be? If you can have some error rate, then you could always label automatically with an available deep network; many deep networks are quite accurate for dog and cat images, since these are often well represented in common training datasets.
– Mozglubov
Nov 15 at 21:35












Hi Mozglubov, in order to model as from the training set i have to split this as cat and dog first and for larger images in a directory i cannot manually annotate each one as cat.1,cat.2 or dog.1,dog.2,so is this there any other alternative for this or i have to manually do this?
– S L SREEJITH
2 days ago




Hi Mozglubov, in order to model as from the training set i have to split this as cat and dog first and for larger images in a directory i cannot manually annotate each one as cat.1,cat.2 or dog.1,dog.2,so is this there any other alternative for this or i have to manually do this?
– S L SREEJITH
2 days ago












Well, what I was trying to get at is whether or not you need that exact annotation. Depending on what you plan to do with this dataset, it is sometimes sufficient to have labels which are mostly correct. This is sometimes referred to as "soft" or "noisy" labelling. This paper by Reed et al. (arxiv.org/abs/1412.6596) provides a good example of achieving strong results despite not having fully accurate training labels. If you absolutely do need a human-accurate set of annotations but cannot do it yourself, then you may want to use services like Amazon's Mechanical Turk.
– Mozglubov
2 days ago




Well, what I was trying to get at is whether or not you need that exact annotation. Depending on what you plan to do with this dataset, it is sometimes sufficient to have labels which are mostly correct. This is sometimes referred to as "soft" or "noisy" labelling. This paper by Reed et al. (arxiv.org/abs/1412.6596) provides a good example of achieving strong results despite not having fully accurate training labels. If you absolutely do need a human-accurate set of annotations but cannot do it yourself, then you may want to use services like Amazon's Mechanical Turk.
– Mozglubov
2 days ago












1 Answer
1






active

oldest

votes

















up vote
0
down vote













One possible solution is to upload your dataset to labelbox (link: https://www.labelbox.com/) there you are able to annotate your dataset and then download the results for instance as a JSON file. The web page correlates your images with the labels and then you can use those informations for your work.






share|improve this answer





















    Your Answer






    StackExchange.ifUsing("editor", function () {
    StackExchange.using("externalEditor", function () {
    StackExchange.using("snippets", function () {
    StackExchange.snippets.init();
    });
    });
    }, "code-snippets");

    StackExchange.ready(function() {
    var channelOptions = {
    tags: "".split(" "),
    id: "1"
    };
    initTagRenderer("".split(" "), "".split(" "), channelOptions);

    StackExchange.using("externalEditor", function() {
    // Have to fire editor after snippets, if snippets enabled
    if (StackExchange.settings.snippets.snippetsEnabled) {
    StackExchange.using("snippets", function() {
    createEditor();
    });
    }
    else {
    createEditor();
    }
    });

    function createEditor() {
    StackExchange.prepareEditor({
    heartbeatType: 'answer',
    convertImagesToLinks: true,
    noModals: true,
    showLowRepImageUploadWarning: true,
    reputationToPostImages: 10,
    bindNavPrevention: true,
    postfix: "",
    imageUploader: {
    brandingHtml: "Powered by u003ca class="icon-imgur-white" href="https://imgur.com/"u003eu003c/au003e",
    contentPolicyHtml: "User contributions licensed under u003ca href="https://creativecommons.org/licenses/by-sa/3.0/"u003ecc by-sa 3.0 with attribution requiredu003c/au003e u003ca href="https://stackoverflow.com/legal/content-policy"u003e(content policy)u003c/au003e",
    allowUrls: true
    },
    onDemand: true,
    discardSelector: ".discard-answer"
    ,immediatelyShowMarkdownHelp:true
    });


    }
    });














     

    draft saved


    draft discarded


















    StackExchange.ready(
    function () {
    StackExchange.openid.initPostLogin('.new-post-login', 'https%3a%2f%2fstackoverflow.com%2fquestions%2f53207102%2fimage-annotation-for-large-dataset%23new-answer', 'question_page');
    }
    );

    Post as a guest















    Required, but never shown

























    1 Answer
    1






    active

    oldest

    votes








    1 Answer
    1






    active

    oldest

    votes









    active

    oldest

    votes






    active

    oldest

    votes








    up vote
    0
    down vote













    One possible solution is to upload your dataset to labelbox (link: https://www.labelbox.com/) there you are able to annotate your dataset and then download the results for instance as a JSON file. The web page correlates your images with the labels and then you can use those informations for your work.






    share|improve this answer

























      up vote
      0
      down vote













      One possible solution is to upload your dataset to labelbox (link: https://www.labelbox.com/) there you are able to annotate your dataset and then download the results for instance as a JSON file. The web page correlates your images with the labels and then you can use those informations for your work.






      share|improve this answer























        up vote
        0
        down vote










        up vote
        0
        down vote









        One possible solution is to upload your dataset to labelbox (link: https://www.labelbox.com/) there you are able to annotate your dataset and then download the results for instance as a JSON file. The web page correlates your images with the labels and then you can use those informations for your work.






        share|improve this answer












        One possible solution is to upload your dataset to labelbox (link: https://www.labelbox.com/) there you are able to annotate your dataset and then download the results for instance as a JSON file. The web page correlates your images with the labels and then you can use those informations for your work.







        share|improve this answer












        share|improve this answer



        share|improve this answer










        answered Nov 8 at 16:58









        Niccolò Cacciotti

        1124




        1124






























             

            draft saved


            draft discarded



















































             


            draft saved


            draft discarded














            StackExchange.ready(
            function () {
            StackExchange.openid.initPostLogin('.new-post-login', 'https%3a%2f%2fstackoverflow.com%2fquestions%2f53207102%2fimage-annotation-for-large-dataset%23new-answer', 'question_page');
            }
            );

            Post as a guest















            Required, but never shown





















































            Required, but never shown














            Required, but never shown












            Required, but never shown







            Required, but never shown

































            Required, but never shown














            Required, but never shown












            Required, but never shown







            Required, but never shown







            Popular posts from this blog

            鏡平學校

            ꓛꓣだゔៀៅຸ໢ທຮ໕໒ ,ໂ'໥໓າ໼ឨឲ៵៭ៈゎゔit''䖳𥁄卿' ☨₤₨こゎもょの;ꜹꟚꞖꞵꟅꞛေၦေɯ,ɨɡ𛃵𛁹ޝ޳ޠ޾,ޤޒޯ޾𫝒𫠁သ𛅤チョ'サノބޘދ𛁐ᶿᶇᶀᶋᶠ㨑㽹⻮ꧬ꧹؍۩وَؠ㇕㇃㇪ ㇦㇋㇋ṜẰᵡᴠ 軌ᵕ搜۳ٰޗޮ޷ސޯ𫖾𫅀ल, ꙭ꙰ꚅꙁꚊꞻꝔ꟠Ꝭㄤﺟޱސꧨꧼ꧴ꧯꧽ꧲ꧯ'⽹⽭⾁⿞⼳⽋២៩ញណើꩯꩤ꩸ꩮᶻᶺᶧᶂ𫳲𫪭𬸄𫵰𬖩𬫣𬊉ၲ𛅬㕦䬺𫝌𫝼,,𫟖𫞽ហៅ஫㆔ాఆఅꙒꚞꙍ,Ꙟ꙱エ ,ポテ,フࢰࢯ𫟠𫞶 𫝤𫟠ﺕﹱﻜﻣ𪵕𪭸𪻆𪾩𫔷ġ,ŧآꞪ꟥,ꞔꝻ♚☹⛵𛀌ꬷꭞȄƁƪƬșƦǙǗdžƝǯǧⱦⱰꓕꓢႋ神 ဴ၀க௭எ௫ឫោ ' េㇷㇴㇼ神ㇸㇲㇽㇴㇼㇻㇸ'ㇸㇿㇸㇹㇰㆣꓚꓤ₡₧ ㄨㄟ㄂ㄖㄎ໗ツڒذ₶।ऩछएोञयूटक़कयँृी,冬'𛅢𛅥ㇱㇵㇶ𥄥𦒽𠣧𠊓𧢖𥞘𩔋цѰㄠſtʯʭɿʆʗʍʩɷɛ,əʏダヵㄐㄘR{gỚṖḺờṠṫảḙḭᴮᵏᴘᵀᵷᵕᴜᴏᵾq﮲ﲿﴽﭙ軌ﰬﶚﶧ﫲Ҝжюїкӈㇴffצּ﬘﭅﬈軌'ffistfflſtffतभफɳɰʊɲʎ𛁱𛁖𛁮𛀉 𛂯𛀞నఋŀŲ 𫟲𫠖𫞺ຆຆ ໹້໕໗ๆทԊꧢꧠ꧰ꓱ⿝⼑ŎḬẃẖỐẅ ,ờỰỈỗﮊDžȩꭏꭎꬻ꭮ꬿꭖꭥꭅ㇭神 ⾈ꓵꓑ⺄㄄ㄪㄙㄅㄇstA۵䞽ॶ𫞑𫝄㇉㇇゜軌𩜛𩳠Jﻺ‚Üမ႕ႌႊၐၸဓၞၞၡ៸wyvtᶎᶪᶹစဎ꣡꣰꣢꣤ٗ؋لㇳㇾㇻㇱ㆐㆔,,㆟Ⱶヤマފ޼ޝަݿݞݠݷݐ',ݘ,ݪݙݵ𬝉𬜁𫝨𫞘くせぉて¼óû×ó£…𛅑הㄙくԗԀ5606神45,神796'𪤻𫞧ꓐ㄁ㄘɥɺꓵꓲ3''7034׉ⱦⱠˆ“𫝋ȍ,ꩲ軌꩷ꩶꩧꩫఞ۔فڱێظペサ神ナᴦᵑ47 9238їﻂ䐊䔉㠸﬎ffiﬣ,לּᴷᴦᵛᵽ,ᴨᵤ ᵸᵥᴗᵈꚏꚉꚟ⻆rtǟƴ𬎎

            Why https connections are so slow when debugging (stepping over) in Java?