You are here: PSPad forum > English discussion forum > How to extract all links from plain text?

How to extract all links from plain text?

Goto Page: 1 2 3 4 5 Next

#1 How to extract all links from plain text?

Posted by: maki | Date: 2020-01-07 01:39 | IP: IP Logged

How to extract all links from plain text?

Find -> CTRL + F -> Find a URL

[Enabled] URLs Can Include Full-Width Unicode Characters
[Enabled] URLs Can Include Half-Widht Unicode Characters

Recognize These Charcters:
Code:
!#$%&'()*+,-./0123456789:;=?@ABCDEFGHIJKLMNOPQRSTUVWXYZ\^_abcdefghijklmnopqrstuvwxyz~

Character Not Allowed at End:
Code:
!'(),.:;?whitespace

Options: Reply | Quote | Up ^


#2 Re: How to extract all links from plain text?

Posted by: pspad | Date: 2020-01-07 04:29 | IP: IP Logged

To extract URL, you can start with Find dialog and press [!] button
If regular expression doesn't fit, modify it as you need

Options: Reply | Quote | Up ^


#3 Re: How to extract all links from plain text?

Posted by: maki | Date: 2020-01-07 12:08 | IP: IP Logged

I want to add a regular expression to another editor. But doesn't work:

Regex Engine:
www.boost.org

^(?:[url]http://[/url])?([^/]+)(?:/?.*/?)/(.*)$
---------------------------
Error
---------------------------
The complexity of matching the regular expression exceeded predefined bounds. Try refactoring the regular expression to make each choice made by the state machine unambiguous. This exception is thrown to prevent "eternal" matches that take an indefinite period time to locate.
---------------------------

Edited 1 time(s). Last edit at 2020-01-07 12:13 by maki.

Options: Reply | Quote | Up ^


#4 Re: How to extract all links from plain text?

Posted by: pspad | Date: 2020-01-07 12:10 | IP: IP Logged

maki:
I want to add a regular expression to another editor. But doesn't work:

If you want to use "another editor" and it doesn't work, you need to request "another editor" support.

Options: Reply | Quote | Up ^


#5 Re: How to extract all links from plain text?

Posted by: maki | Date: 2020-01-07 12:18 | IP: IP Logged

Also not work PSPad;

imagesad smiley

Options: Reply | Quote | Up ^


#6 Re: How to extract all links from plain text?

Posted by: pspad | Date: 2020-01-07 12:57 | IP: IP Logged

I see. It's a quiz for me. I should guess what regular expression did you use.

Use this regular expression:
(news|http|ftp|https):\/\/[\w\-_]+(\.[\w]+)+([\w\-\.,@?^=%&:/~\+#]*[\w\-\@?^=%&/~\+#])?

Edited 1 time(s). Last edit at 2020-01-07 13:00 by pspad.

Options: Reply | Quote | Up ^


#7 Re: How to extract all links from plain text?

Posted by: pspad | Date: 2020-01-07 13:25 | IP: IP Logged

Please ensure you deleted content of find box before you click on regular expression example. The regular expression I put into last answer was the same what is inserted when you click on the Find URL item.

When you want to extract all URLs into new document, use Button COPY

Options: Reply | Quote | Up ^


#8 Re: How to extract all links from plain text?

Posted by: maki | Date: 2020-01-07 14:25 | IP: IP Logged

I have 100 TXT files and I want to extract all URLs from all text files.
You need to open 100 TABS sad smiley ??? and how to multi-search all tabs?

Some operations in PSPad are impossible to perform or are more complex than in another editor. That's why I talked about another editor that has easier tasks, but without regex it is impossi

Edited 1 time(s). Last edit at 2020-01-07 14:27 by maki.

Options: Reply | Quote | Up ^


#9 Re: How to extract all links from plain text?

Posted by: pspad | Date: 2020-01-07 17:50 | IP: IP Logged

You can make one file from all your files.
Use COPY command or use Insert file from PSPad functionality - you can insert all files into one. You asked me about it few days ago.

Options: Reply | Quote | Up ^


#10 Re: How to extract all links from plain text?

Posted by: maki | Date: 2020-01-07 20:02 | IP: IP Logged

Where are these options?

after closing the application.
Error sound without window with error information.
I don't want to use PSPad because the application still crashes.

Windowd Event Log:

Version=1
EventType=APPCRASH
EventTime=132229006452178576
ReportType=2
Consent=1
UploadTime=132229006452668604
ReportIdentifier=ec344f25-3187-11ea-9df1-0862662a261f
IntegratorReportIdentifier=ec344f24-3187-11ea-9df1-0862662a261f
Response.BucketId=3024011848
Response.BucketTable=496937667
Response.type=4
Sig[0].Name=Nazwa aplikacji
Sig[0].Value=PSPad.exe
Sig[1].Name=Wersja aplikacji
Sig[1].Value=5.0.3.360
Sig[2].Name=Sygnatura czasowa aplikacji
Sig[2].Value=5e12c0fd
Sig[3].Name=Nazwa modułu z błędem
Sig[3].Value=PSPad.exe
Sig[4].Name=Wersja modułu z błędem
Sig[4].Value=5.0.3.360
Sig[5].Name=Sygnatura czasowa modułu z błędem
Sig[5].Value=5e12c0fd
Sig[6].Name=Kod wyjątku
Sig[6].Value=c0000005
Sig[7].Name=Przesunięcie wyjątku
Sig[7].Value=00000000003877f0
DynamicSig[1].Name=Wersja systemu operacyjnego
DynamicSig[1].Value=6.1.7601.2.1.0.256.1
DynamicSig[2].Name=Identyfikator ustawień regionalnych
DynamicSig[2].Value=1045
DynamicSig[22].Name=Dodatkowe informacje 1
DynamicSig[22].Value=2881
DynamicSig[23].Name=Dodatkowe informacje 2
DynamicSig[23].Value=2881bb527cca8a9cfb7040aa19d86956
DynamicSig[24].Name=Dodatkowe informacje 3
DynamicSig[24].Value=e88d
DynamicSig[25].Name=Dodatkowe informacje 4
DynamicSig[25].Value=e88d8c5e5e1c485bf89667ee909d0959
UI[2]=C:\Program Files (x86)\PSPad editor\PSPad.exe
UI[3]=Program PSPad przestał działać.
UI[4]=System Windows może wyszukać rozwiązanie tego problemu w trybie online.
UI[5]=Wyszukaj rozwiązanie w trybie online i zamknij program
UI[6]=Wyszukaj rozwiązanie w trybie online później i zamknij program
UI[7]=Zamknij program
LoadedModule[0]=C:\Program Files (x86)\PSPad editor\PSPad.exe
LoadedModule[1]=C:\Windows\SYSTEM32\ntdll.dll
LoadedModule[2]=C:\Windows\system32\kernel32.dll
LoadedModule[3]=C:\Windows\system32\KERNELBASE.dll
LoadedModule[4]=C:\Windows\system32\winspool.drv
LoadedModule[5]=C:\Windows\system32\msvcrt.dll
LoadedModule[6]=C:\Windows\system32\GDI32.dll
LoadedModule[7]=C:\Windows\system32\USER32.dll
LoadedModule[8]=C:\Windows\system32\LPK.dll
LoadedModule[9]=C:\Windows\system32\USP10.dll
LoadedModule[10]=C:\Windows\system32\comdlg32.dll
LoadedModule[11]=C:\Windows\system32\SHLWAPI.dll
LoadedModule[12]=C:\Windows\WinSxS\amd64_microsoft.windows.common-controls_6595b64144ccf1df_6.0.7601.24483_none_e372d88f30fbb845\COMCTL32.dll
LoadedModule[13]=C:\Windows\system32\SHELL32.dll
LoadedModule[14]=C:\Windows\system32\version.dll
LoadedModule[15]=C:\Windows\system32\oledlg.dll
LoadedModule[16]=C:\Windows\system32\ADVAPI32.dll
LoadedModule[17]=C:\Windows\SYSTEM32\sechost.dll
LoadedModule[18]=C:\Windows\system32\RPCRT4.dll
LoadedModule[19]=C:\Windows\system32\ole32.dll
LoadedModule[20]=C:\Windows\system32\oleaut32.dll
LoadedModule[21]=C:\Windows\system32\netapi32.dll
LoadedModule[22]=C:\Windows\system32\netutils.dll
LoadedModule[23]=C:\Windows\system32\srvcli.dll
LoadedModule[24]=C:\Windows\system32\wkscli.dll
LoadedModule[25]=C:\Windows\system32\wsock32.dll
LoadedModule[26]=C:\Windows\system32\WS2_32.dll
LoadedModule[27]=C:\Windows\system32\NSI.dll
LoadedModule[28]=C:\Windows\system32\IMM32.DLL
LoadedModule[29]=C:\Windows\system32\MSCTF.dll
LoadedModule[30]=C:\Windows\system32\uxtheme.dll
LoadedModule[31]=C:\Windows\system32\CRYPTBASE.dll
LoadedModule[32]=C:\Program Files (x86)\Ad Muncher\AM64-34121.dll
LoadedModule[33]=C:\Windows\system32\dwmapi.dll
LoadedModule[34]=C:\Windows\system32\wtsapi32.dll
LoadedModule[35]=C:\Windows\system32\WINSTA.dll
LoadedModule[36]=C:\Windows\system32\RICHED20.DLL
LoadedModule[37]=C:\Windows\system32\profapi.dll
LoadedModule[38]=C:\Windows\system32\SETUPAPI.dll
LoadedModule[39]=C:\Windows\system32\CFGMGR32.dll
LoadedModule[40]=C:\Windows\system32\DEVOBJ.dll
LoadedModule[41]=C:\Windows\system32\CLBCatQ.DLL
LoadedModule[42]=C:\Windows\system32\propsys.dll
LoadedModule[43]=C:\Windows\system32\ntmarta.dll
LoadedModule[44]=C:\Windows\system32\WLDAP32.dll
LoadedModule[45]=C:\Windows\system32\WindowsCodecs.dll
LoadedModule[46]=C:\Windows\system32\apphelp.dll
LoadedModule[47]=C:\Windows\system32\EhStorShell.dll
LoadedModule[48]=C:\Windows\System32\cscui.dll
LoadedModule[49]=C:\Windows\System32\CSCDLL.dll
LoadedModule[50]=C:\Windows\system32\CSCAPI.dll
LoadedModule[51]=C:\Windows\system32\ntshrui.dll
LoadedModule[52]=C:\Windows\system32\slc.dll
LoadedModule[53]=C:\Windows\system32\Secur32.dll
LoadedModule[54]=C:\Windows\system32\SSPICLI.DLL
LoadedModule[55]=C:\Windows\system32\samcli.dll
LoadedModule[56]=C:\Windows\system32\SAMLIB.dll
LoadedModule[57]=C:\Windows\System32\shdocvw.dll
LoadedModule[58]=C:\Windows\system32\MPR.dll
LoadedModule[59]=C:\Windows\System32\drprov.dll
LoadedModule[60]=C:\Windows\System32\ntlanman.dll
LoadedModule[61]=C:\Windows\System32\davclnt.dll
LoadedModule[62]=C:\Windows\System32\DAVHLPR.dll
LoadedModule[63]=C:\Windows\system32\wpdshext.dll
LoadedModule[64]=C:\Windows\system32\WINMM.dll
LoadedModule[65]=C:\Windows\WinSxS\amd64_microsoft.windows.gdiplus_6595b64144ccf1df_1.1.7601.24494_none_145eb4f68b8d650b\gdiplus.dll
LoadedModule[66]=C:\Windows\system32\PortableDeviceApi.dll
LoadedModule[67]=C:\Windows\system32\WINTRUST.dll
LoadedModule[68]=C:\Windows\system32\CRYPT32.dll
LoadedModule[69]=C:\Windows\system32\MSASN1.dll
LoadedModule[70]=C:\Windows\system32\PortableDeviceTypes.dll
LoadedModule[71]=C:\Windows\system32\EhStorAPI.dll
LoadedModule[72]=C:\Windows\system32\NetworkExplorer.dll
LoadedModule[73]=C:\Program Files\Common Files\microsoft shared\ink\tiptsf.dll
LoadedModule[74]=C:\Windows\system32\explorerframe.dll
LoadedModule[75]=C:\Windows\system32\DUser.dll
LoadedModule[76]=C:\Windows\system32\DUI70.dll
LoadedModule[77]=C:\Windows\system32\msls31.dll
LoadedModule[78]=C:\Windows\system32\xmllite.dll
LoadedModule[79]=C:\Windows\system32\CRYPTSP.dll
LoadedModule[80]=C:\Windows\system32\rsaenh.dll
LoadedModule[81]=C:\Windows\system32\RpcRtRemote.dll
LoadedModule[82]=C:\Windows\System32\StructuredQuery.dll
LoadedModule[83]=C:\Windows\System32\actxprxy.dll
LoadedModule[84]=C:\Program Files\internet explorer\ieproxy.dll
LoadedModule[85]=C:\Windows\system32\api-ms-win-downlevel-shlwapi-l1-1-0.dll
LoadedModule[86]=C:\Windows\system32\api-ms-win-downlevel-shlwapi-l2-1-0.dll
LoadedModule[87]=C:\Windows\system32\api-ms-win-downlevel-advapi32-l1-1-0.dll
LoadedModule[88]=C:\Windows\system32\thumbcache.dll
LoadedModule[89]=C:\Windows\system32\PSAPI.DLL
LoadedModule[90]=C:\Windows\system32\ieframe.DLL
LoadedModule[91]=C:\Windows\system32\api-ms-win-downlevel-user32-l1-1-0.dll
LoadedModule[92]=C:\Windows\system32\api-ms-win-downlevel-shell32-l1-1-0.dll
LoadedModule[93]=C:\Windows\system32\api-ms-win-downlevel-version-l1-1-0.dll
LoadedModule[94]=C:\Windows\system32\api-ms-win-downlevel-normaliz-l1-1-0.dll
LoadedModule[95]=C:\Windows\system32\normaliz.DLL
LoadedModule[96]=C:\Windows\system32\iertutil.dll
LoadedModule[97]=C:\Windows\system32\WINHTTP.dll
LoadedModule[98]=C:\Windows\system32\webio.dll
LoadedModule[99]=C:\Windows\system32\SearchFolder.dll
LoadedModule[100]=C:\Windows\system32\LINKINFO.dll
LoadedModule[101]=C:\Windows\system32\zipfldr.dll
State[0].Key=Transport.DoneStage1
State[0].Value=1
State[1].Key=DataRequest
State[1].Value=Bucket=-1270955448/nBucketTable=496937667/nResponse=1/n
FriendlyEventName=Zatrzymano działanie.
ConsentKey=APPCRASH
AppName=PSPad
AppPath=C:\Program Files (x86)\PSPad editor\PSPad.exe

Edited 1 time(s). Last edit at 2020-01-07 20:03 by maki.

Options: Reply | Quote | Up ^


Goto Page: 1 2 3 4 5 Next





Editor PSPad - freeware editor, © 2001 - 2024 Jan Fiala, Hosted by Webhosting TOJEONO.CZ, design by WebDesign PAY & SOFT, code Petr Dvořák, Privacy policy and GDPR