extract text every line and check in regular expression












-2















Hi I have the following 2 documents amongst many in the form of a list.



Document 1:



 contact
TomGonsalves
985664774
6560,m.d avenue.

harrydsouza
7676764
543,gt,road.


Document 2 in the same python list:



  contact
richard peterson
4563565
87,td street,Ny.


Here both the above text documents(1 and 2) are part of the same. In the text, I need to extract the 1st line after contact,i.e the name. However in Document-1 there are 2 names which I need to extract and just the names.



I need to write a regular expression to fetch the needful and the output should look like



     Names from Document 1 =[tomgonsalves,harrydsouza]
names from document-2=[richard peterson]


Can someone please help me with the reGex script in python 3 for this.There are many docs in the same list which may have different number of names in its contact. I just need to extract the names alone(be it 1 or multiple)










share|improve this question


















  • 1





    Honestly, I might prefer just iterating the file line by line and implementing some simple parsing logic.

    – Tim Biegeleisen
    Nov 19 '18 at 15:51
















-2















Hi I have the following 2 documents amongst many in the form of a list.



Document 1:



 contact
TomGonsalves
985664774
6560,m.d avenue.

harrydsouza
7676764
543,gt,road.


Document 2 in the same python list:



  contact
richard peterson
4563565
87,td street,Ny.


Here both the above text documents(1 and 2) are part of the same. In the text, I need to extract the 1st line after contact,i.e the name. However in Document-1 there are 2 names which I need to extract and just the names.



I need to write a regular expression to fetch the needful and the output should look like



     Names from Document 1 =[tomgonsalves,harrydsouza]
names from document-2=[richard peterson]


Can someone please help me with the reGex script in python 3 for this.There are many docs in the same list which may have different number of names in its contact. I just need to extract the names alone(be it 1 or multiple)










share|improve this question


















  • 1





    Honestly, I might prefer just iterating the file line by line and implementing some simple parsing logic.

    – Tim Biegeleisen
    Nov 19 '18 at 15:51














-2












-2








-2








Hi I have the following 2 documents amongst many in the form of a list.



Document 1:



 contact
TomGonsalves
985664774
6560,m.d avenue.

harrydsouza
7676764
543,gt,road.


Document 2 in the same python list:



  contact
richard peterson
4563565
87,td street,Ny.


Here both the above text documents(1 and 2) are part of the same. In the text, I need to extract the 1st line after contact,i.e the name. However in Document-1 there are 2 names which I need to extract and just the names.



I need to write a regular expression to fetch the needful and the output should look like



     Names from Document 1 =[tomgonsalves,harrydsouza]
names from document-2=[richard peterson]


Can someone please help me with the reGex script in python 3 for this.There are many docs in the same list which may have different number of names in its contact. I just need to extract the names alone(be it 1 or multiple)










share|improve this question














Hi I have the following 2 documents amongst many in the form of a list.



Document 1:



 contact
TomGonsalves
985664774
6560,m.d avenue.

harrydsouza
7676764
543,gt,road.


Document 2 in the same python list:



  contact
richard peterson
4563565
87,td street,Ny.


Here both the above text documents(1 and 2) are part of the same. In the text, I need to extract the 1st line after contact,i.e the name. However in Document-1 there are 2 names which I need to extract and just the names.



I need to write a regular expression to fetch the needful and the output should look like



     Names from Document 1 =[tomgonsalves,harrydsouza]
names from document-2=[richard peterson]


Can someone please help me with the reGex script in python 3 for this.There are many docs in the same list which may have different number of names in its contact. I just need to extract the names alone(be it 1 or multiple)







regex python-3.x






share|improve this question













share|improve this question











share|improve this question




share|improve this question










asked Nov 19 '18 at 15:48









sayan_sensayan_sen

217




217








  • 1





    Honestly, I might prefer just iterating the file line by line and implementing some simple parsing logic.

    – Tim Biegeleisen
    Nov 19 '18 at 15:51














  • 1





    Honestly, I might prefer just iterating the file line by line and implementing some simple parsing logic.

    – Tim Biegeleisen
    Nov 19 '18 at 15:51








1




1





Honestly, I might prefer just iterating the file line by line and implementing some simple parsing logic.

– Tim Biegeleisen
Nov 19 '18 at 15:51





Honestly, I might prefer just iterating the file line by line and implementing some simple parsing logic.

– Tim Biegeleisen
Nov 19 '18 at 15:51












2 Answers
2






active

oldest

votes


















1














You can try something like this:



Read the txt file line-by-line. For each line check if it has only alphabets with isalpha() function of string.



In [309]: with open('ff.txt') as f: ## ff.txt is the file you shared
...: for i in f:
...: i = re.sub('n','',i)
...: if i.isalpha():
...: if not i == 'contact':
...: print(i)

TomGonsalves
harrydsouza


You can do this for each file you have.






share|improve this answer


























  • @sayan_sen Let me know if it works for you.

    – Mayank Porwal
    Nov 20 '18 at 4:11



















0














Although this should be preferred to be attacked by simple text parsing but just in case you want a regex solution, you may use this regex,



(?:^s*[a-zA-Z]+s*|(ns*){2})([w ]+)


Here is a demo



Let me know if this works for you, else please provide more sample input so I can further refine my regex to match your precise input.






share|improve this answer























    Your Answer






    StackExchange.ifUsing("editor", function () {
    StackExchange.using("externalEditor", function () {
    StackExchange.using("snippets", function () {
    StackExchange.snippets.init();
    });
    });
    }, "code-snippets");

    StackExchange.ready(function() {
    var channelOptions = {
    tags: "".split(" "),
    id: "1"
    };
    initTagRenderer("".split(" "), "".split(" "), channelOptions);

    StackExchange.using("externalEditor", function() {
    // Have to fire editor after snippets, if snippets enabled
    if (StackExchange.settings.snippets.snippetsEnabled) {
    StackExchange.using("snippets", function() {
    createEditor();
    });
    }
    else {
    createEditor();
    }
    });

    function createEditor() {
    StackExchange.prepareEditor({
    heartbeatType: 'answer',
    autoActivateHeartbeat: false,
    convertImagesToLinks: true,
    noModals: true,
    showLowRepImageUploadWarning: true,
    reputationToPostImages: 10,
    bindNavPrevention: true,
    postfix: "",
    imageUploader: {
    brandingHtml: "Powered by u003ca class="icon-imgur-white" href="https://imgur.com/"u003eu003c/au003e",
    contentPolicyHtml: "User contributions licensed under u003ca href="https://creativecommons.org/licenses/by-sa/3.0/"u003ecc by-sa 3.0 with attribution requiredu003c/au003e u003ca href="https://stackoverflow.com/legal/content-policy"u003e(content policy)u003c/au003e",
    allowUrls: true
    },
    onDemand: true,
    discardSelector: ".discard-answer"
    ,immediatelyShowMarkdownHelp:true
    });


    }
    });














    draft saved

    draft discarded


















    StackExchange.ready(
    function () {
    StackExchange.openid.initPostLogin('.new-post-login', 'https%3a%2f%2fstackoverflow.com%2fquestions%2f53378216%2fextract-text-every-line-and-check-in-regular-expression%23new-answer', 'question_page');
    }
    );

    Post as a guest















    Required, but never shown

























    2 Answers
    2






    active

    oldest

    votes








    2 Answers
    2






    active

    oldest

    votes









    active

    oldest

    votes






    active

    oldest

    votes









    1














    You can try something like this:



    Read the txt file line-by-line. For each line check if it has only alphabets with isalpha() function of string.



    In [309]: with open('ff.txt') as f: ## ff.txt is the file you shared
    ...: for i in f:
    ...: i = re.sub('n','',i)
    ...: if i.isalpha():
    ...: if not i == 'contact':
    ...: print(i)

    TomGonsalves
    harrydsouza


    You can do this for each file you have.






    share|improve this answer


























    • @sayan_sen Let me know if it works for you.

      – Mayank Porwal
      Nov 20 '18 at 4:11
















    1














    You can try something like this:



    Read the txt file line-by-line. For each line check if it has only alphabets with isalpha() function of string.



    In [309]: with open('ff.txt') as f: ## ff.txt is the file you shared
    ...: for i in f:
    ...: i = re.sub('n','',i)
    ...: if i.isalpha():
    ...: if not i == 'contact':
    ...: print(i)

    TomGonsalves
    harrydsouza


    You can do this for each file you have.






    share|improve this answer


























    • @sayan_sen Let me know if it works for you.

      – Mayank Porwal
      Nov 20 '18 at 4:11














    1












    1








    1







    You can try something like this:



    Read the txt file line-by-line. For each line check if it has only alphabets with isalpha() function of string.



    In [309]: with open('ff.txt') as f: ## ff.txt is the file you shared
    ...: for i in f:
    ...: i = re.sub('n','',i)
    ...: if i.isalpha():
    ...: if not i == 'contact':
    ...: print(i)

    TomGonsalves
    harrydsouza


    You can do this for each file you have.






    share|improve this answer















    You can try something like this:



    Read the txt file line-by-line. For each line check if it has only alphabets with isalpha() function of string.



    In [309]: with open('ff.txt') as f: ## ff.txt is the file you shared
    ...: for i in f:
    ...: i = re.sub('n','',i)
    ...: if i.isalpha():
    ...: if not i == 'contact':
    ...: print(i)

    TomGonsalves
    harrydsouza


    You can do this for each file you have.







    share|improve this answer














    share|improve this answer



    share|improve this answer








    edited Nov 20 '18 at 4:12

























    answered Nov 19 '18 at 16:01









    Mayank PorwalMayank Porwal

    4,9202724




    4,9202724













    • @sayan_sen Let me know if it works for you.

      – Mayank Porwal
      Nov 20 '18 at 4:11



















    • @sayan_sen Let me know if it works for you.

      – Mayank Porwal
      Nov 20 '18 at 4:11

















    @sayan_sen Let me know if it works for you.

    – Mayank Porwal
    Nov 20 '18 at 4:11





    @sayan_sen Let me know if it works for you.

    – Mayank Porwal
    Nov 20 '18 at 4:11













    0














    Although this should be preferred to be attacked by simple text parsing but just in case you want a regex solution, you may use this regex,



    (?:^s*[a-zA-Z]+s*|(ns*){2})([w ]+)


    Here is a demo



    Let me know if this works for you, else please provide more sample input so I can further refine my regex to match your precise input.






    share|improve this answer




























      0














      Although this should be preferred to be attacked by simple text parsing but just in case you want a regex solution, you may use this regex,



      (?:^s*[a-zA-Z]+s*|(ns*){2})([w ]+)


      Here is a demo



      Let me know if this works for you, else please provide more sample input so I can further refine my regex to match your precise input.






      share|improve this answer


























        0












        0








        0







        Although this should be preferred to be attacked by simple text parsing but just in case you want a regex solution, you may use this regex,



        (?:^s*[a-zA-Z]+s*|(ns*){2})([w ]+)


        Here is a demo



        Let me know if this works for you, else please provide more sample input so I can further refine my regex to match your precise input.






        share|improve this answer













        Although this should be preferred to be attacked by simple text parsing but just in case you want a regex solution, you may use this regex,



        (?:^s*[a-zA-Z]+s*|(ns*){2})([w ]+)


        Here is a demo



        Let me know if this works for you, else please provide more sample input so I can further refine my regex to match your precise input.







        share|improve this answer












        share|improve this answer



        share|improve this answer










        answered Nov 19 '18 at 16:13









        Pushpesh Kumar RajwanshiPushpesh Kumar Rajwanshi

        7,5932927




        7,5932927






























            draft saved

            draft discarded




















































            Thanks for contributing an answer to Stack Overflow!


            • Please be sure to answer the question. Provide details and share your research!

            But avoid



            • Asking for help, clarification, or responding to other answers.

            • Making statements based on opinion; back them up with references or personal experience.


            To learn more, see our tips on writing great answers.




            draft saved


            draft discarded














            StackExchange.ready(
            function () {
            StackExchange.openid.initPostLogin('.new-post-login', 'https%3a%2f%2fstackoverflow.com%2fquestions%2f53378216%2fextract-text-every-line-and-check-in-regular-expression%23new-answer', 'question_page');
            }
            );

            Post as a guest















            Required, but never shown





















































            Required, but never shown














            Required, but never shown












            Required, but never shown







            Required, but never shown

































            Required, but never shown














            Required, but never shown












            Required, but never shown







            Required, but never shown







            Popular posts from this blog

            Guess what letter conforming each word

            Run scheduled task as local user group (not BUILTIN)

            Port of Spain