Image Annotation for Large Dataset











up vote
0
down vote

favorite












Screenshot



I have this huge data set for which I have only taken a sample from that data to show you,now as you can see it has two class cat and dog where in the training data i have to label it manually since the cat and dog images are mixed, so is there any alternative way to do it.I have to annotate this then only i can train as to if whether cat or dog.










share|improve this question
























  • How accurate must the labelling be? If you can have some error rate, then you could always label automatically with an available deep network; many deep networks are quite accurate for dog and cat images, since these are often well represented in common training datasets.
    – Mozglubov
    Nov 15 at 21:35










  • Hi Mozglubov, in order to model as from the training set i have to split this as cat and dog first and for larger images in a directory i cannot manually annotate each one as cat.1,cat.2 or dog.1,dog.2,so is this there any other alternative for this or i have to manually do this?
    – S L SREEJITH
    2 days ago










  • Well, what I was trying to get at is whether or not you need that exact annotation. Depending on what you plan to do with this dataset, it is sometimes sufficient to have labels which are mostly correct. This is sometimes referred to as "soft" or "noisy" labelling. This paper by Reed et al. (arxiv.org/abs/1412.6596) provides a good example of achieving strong results despite not having fully accurate training labels. If you absolutely do need a human-accurate set of annotations but cannot do it yourself, then you may want to use services like Amazon's Mechanical Turk.
    – Mozglubov
    2 days ago















up vote
0
down vote

favorite












Screenshot



I have this huge data set for which I have only taken a sample from that data to show you,now as you can see it has two class cat and dog where in the training data i have to label it manually since the cat and dog images are mixed, so is there any alternative way to do it.I have to annotate this then only i can train as to if whether cat or dog.










share|improve this question
























  • How accurate must the labelling be? If you can have some error rate, then you could always label automatically with an available deep network; many deep networks are quite accurate for dog and cat images, since these are often well represented in common training datasets.
    – Mozglubov
    Nov 15 at 21:35










  • Hi Mozglubov, in order to model as from the training set i have to split this as cat and dog first and for larger images in a directory i cannot manually annotate each one as cat.1,cat.2 or dog.1,dog.2,so is this there any other alternative for this or i have to manually do this?
    – S L SREEJITH
    2 days ago










  • Well, what I was trying to get at is whether or not you need that exact annotation. Depending on what you plan to do with this dataset, it is sometimes sufficient to have labels which are mostly correct. This is sometimes referred to as "soft" or "noisy" labelling. This paper by Reed et al. (arxiv.org/abs/1412.6596) provides a good example of achieving strong results despite not having fully accurate training labels. If you absolutely do need a human-accurate set of annotations but cannot do it yourself, then you may want to use services like Amazon's Mechanical Turk.
    – Mozglubov
    2 days ago













up vote
0
down vote

favorite









up vote
0
down vote

favorite











Screenshot



I have this huge data set for which I have only taken a sample from that data to show you,now as you can see it has two class cat and dog where in the training data i have to label it manually since the cat and dog images are mixed, so is there any alternative way to do it.I have to annotate this then only i can train as to if whether cat or dog.










share|improve this question















Screenshot



I have this huge data set for which I have only taken a sample from that data to show you,now as you can see it has two class cat and dog where in the training data i have to label it manually since the cat and dog images are mixed, so is there any alternative way to do it.I have to annotate this then only i can train as to if whether cat or dog.







annotations computer-vision transfer-learning






share|improve this question















share|improve this question













share|improve this question




share|improve this question








edited Nov 8 at 13:20









Dave

1,91051322




1,91051322










asked Nov 8 at 11:46









S L SREEJITH

12




12












  • How accurate must the labelling be? If you can have some error rate, then you could always label automatically with an available deep network; many deep networks are quite accurate for dog and cat images, since these are often well represented in common training datasets.
    – Mozglubov
    Nov 15 at 21:35










  • Hi Mozglubov, in order to model as from the training set i have to split this as cat and dog first and for larger images in a directory i cannot manually annotate each one as cat.1,cat.2 or dog.1,dog.2,so is this there any other alternative for this or i have to manually do this?
    – S L SREEJITH
    2 days ago










  • Well, what I was trying to get at is whether or not you need that exact annotation. Depending on what you plan to do with this dataset, it is sometimes sufficient to have labels which are mostly correct. This is sometimes referred to as "soft" or "noisy" labelling. This paper by Reed et al. (arxiv.org/abs/1412.6596) provides a good example of achieving strong results despite not having fully accurate training labels. If you absolutely do need a human-accurate set of annotations but cannot do it yourself, then you may want to use services like Amazon's Mechanical Turk.
    – Mozglubov
    2 days ago


















  • How accurate must the labelling be? If you can have some error rate, then you could always label automatically with an available deep network; many deep networks are quite accurate for dog and cat images, since these are often well represented in common training datasets.
    – Mozglubov
    Nov 15 at 21:35










  • Hi Mozglubov, in order to model as from the training set i have to split this as cat and dog first and for larger images in a directory i cannot manually annotate each one as cat.1,cat.2 or dog.1,dog.2,so is this there any other alternative for this or i have to manually do this?
    – S L SREEJITH
    2 days ago










  • Well, what I was trying to get at is whether or not you need that exact annotation. Depending on what you plan to do with this dataset, it is sometimes sufficient to have labels which are mostly correct. This is sometimes referred to as "soft" or "noisy" labelling. This paper by Reed et al. (arxiv.org/abs/1412.6596) provides a good example of achieving strong results despite not having fully accurate training labels. If you absolutely do need a human-accurate set of annotations but cannot do it yourself, then you may want to use services like Amazon's Mechanical Turk.
    – Mozglubov
    2 days ago
















How accurate must the labelling be? If you can have some error rate, then you could always label automatically with an available deep network; many deep networks are quite accurate for dog and cat images, since these are often well represented in common training datasets.
– Mozglubov
Nov 15 at 21:35




How accurate must the labelling be? If you can have some error rate, then you could always label automatically with an available deep network; many deep networks are quite accurate for dog and cat images, since these are often well represented in common training datasets.
– Mozglubov
Nov 15 at 21:35












Hi Mozglubov, in order to model as from the training set i have to split this as cat and dog first and for larger images in a directory i cannot manually annotate each one as cat.1,cat.2 or dog.1,dog.2,so is this there any other alternative for this or i have to manually do this?
– S L SREEJITH
2 days ago




Hi Mozglubov, in order to model as from the training set i have to split this as cat and dog first and for larger images in a directory i cannot manually annotate each one as cat.1,cat.2 or dog.1,dog.2,so is this there any other alternative for this or i have to manually do this?
– S L SREEJITH
2 days ago












Well, what I was trying to get at is whether or not you need that exact annotation. Depending on what you plan to do with this dataset, it is sometimes sufficient to have labels which are mostly correct. This is sometimes referred to as "soft" or "noisy" labelling. This paper by Reed et al. (arxiv.org/abs/1412.6596) provides a good example of achieving strong results despite not having fully accurate training labels. If you absolutely do need a human-accurate set of annotations but cannot do it yourself, then you may want to use services like Amazon's Mechanical Turk.
– Mozglubov
2 days ago




Well, what I was trying to get at is whether or not you need that exact annotation. Depending on what you plan to do with this dataset, it is sometimes sufficient to have labels which are mostly correct. This is sometimes referred to as "soft" or "noisy" labelling. This paper by Reed et al. (arxiv.org/abs/1412.6596) provides a good example of achieving strong results despite not having fully accurate training labels. If you absolutely do need a human-accurate set of annotations but cannot do it yourself, then you may want to use services like Amazon's Mechanical Turk.
– Mozglubov
2 days ago












1 Answer
1






active

oldest

votes

















up vote
0
down vote













One possible solution is to upload your dataset to labelbox (link: https://www.labelbox.com/) there you are able to annotate your dataset and then download the results for instance as a JSON file. The web page correlates your images with the labels and then you can use those informations for your work.






share|improve this answer





















    Your Answer






    StackExchange.ifUsing("editor", function () {
    StackExchange.using("externalEditor", function () {
    StackExchange.using("snippets", function () {
    StackExchange.snippets.init();
    });
    });
    }, "code-snippets");

    StackExchange.ready(function() {
    var channelOptions = {
    tags: "".split(" "),
    id: "1"
    };
    initTagRenderer("".split(" "), "".split(" "), channelOptions);

    StackExchange.using("externalEditor", function() {
    // Have to fire editor after snippets, if snippets enabled
    if (StackExchange.settings.snippets.snippetsEnabled) {
    StackExchange.using("snippets", function() {
    createEditor();
    });
    }
    else {
    createEditor();
    }
    });

    function createEditor() {
    StackExchange.prepareEditor({
    heartbeatType: 'answer',
    convertImagesToLinks: true,
    noModals: true,
    showLowRepImageUploadWarning: true,
    reputationToPostImages: 10,
    bindNavPrevention: true,
    postfix: "",
    imageUploader: {
    brandingHtml: "Powered by u003ca class="icon-imgur-white" href="https://imgur.com/"u003eu003c/au003e",
    contentPolicyHtml: "User contributions licensed under u003ca href="https://creativecommons.org/licenses/by-sa/3.0/"u003ecc by-sa 3.0 with attribution requiredu003c/au003e u003ca href="https://stackoverflow.com/legal/content-policy"u003e(content policy)u003c/au003e",
    allowUrls: true
    },
    onDemand: true,
    discardSelector: ".discard-answer"
    ,immediatelyShowMarkdownHelp:true
    });


    }
    });














     

    draft saved


    draft discarded


















    StackExchange.ready(
    function () {
    StackExchange.openid.initPostLogin('.new-post-login', 'https%3a%2f%2fstackoverflow.com%2fquestions%2f53207102%2fimage-annotation-for-large-dataset%23new-answer', 'question_page');
    }
    );

    Post as a guest















    Required, but never shown

























    1 Answer
    1






    active

    oldest

    votes








    1 Answer
    1






    active

    oldest

    votes









    active

    oldest

    votes






    active

    oldest

    votes








    up vote
    0
    down vote













    One possible solution is to upload your dataset to labelbox (link: https://www.labelbox.com/) there you are able to annotate your dataset and then download the results for instance as a JSON file. The web page correlates your images with the labels and then you can use those informations for your work.






    share|improve this answer

























      up vote
      0
      down vote













      One possible solution is to upload your dataset to labelbox (link: https://www.labelbox.com/) there you are able to annotate your dataset and then download the results for instance as a JSON file. The web page correlates your images with the labels and then you can use those informations for your work.






      share|improve this answer























        up vote
        0
        down vote










        up vote
        0
        down vote









        One possible solution is to upload your dataset to labelbox (link: https://www.labelbox.com/) there you are able to annotate your dataset and then download the results for instance as a JSON file. The web page correlates your images with the labels and then you can use those informations for your work.






        share|improve this answer












        One possible solution is to upload your dataset to labelbox (link: https://www.labelbox.com/) there you are able to annotate your dataset and then download the results for instance as a JSON file. The web page correlates your images with the labels and then you can use those informations for your work.







        share|improve this answer












        share|improve this answer



        share|improve this answer










        answered Nov 8 at 16:58









        Niccolò Cacciotti

        1124




        1124






























             

            draft saved


            draft discarded



















































             


            draft saved


            draft discarded














            StackExchange.ready(
            function () {
            StackExchange.openid.initPostLogin('.new-post-login', 'https%3a%2f%2fstackoverflow.com%2fquestions%2f53207102%2fimage-annotation-for-large-dataset%23new-answer', 'question_page');
            }
            );

            Post as a guest















            Required, but never shown





















































            Required, but never shown














            Required, but never shown












            Required, but never shown







            Required, but never shown

































            Required, but never shown














            Required, but never shown












            Required, but never shown







            Required, but never shown







            Popular posts from this blog

            Guess what letter conforming each word

            Run scheduled task as local user group (not BUILTIN)

            Port of Spain