-  [WT]  [PS]  [Home] [Manage]

  1.   (new thread)
  2. [ No File]
  3. (for post and file deletion)
/pr/ - Programming
  • Supported file types are: C, CSS, DOC, DOCX, GIF, H, JAVA, JPG, PDF, PNG, SVG, SWF, TXT, WEBM
  • Maximum file size allowed is 10000 KB.
  • Images greater than 200x200 pixels will be thumbnailed.
  • Currently 419 unique user posts. View catalog

  • Blotter updated: 2018-08-24 Show/Hide Show All

We are in the process of fixing long-standing bugs with the thread reader. This will probably cause more bugs for a short period of time. Buckle up.

Movies & TV 24/7 via Channel7: Web Player, .m3u file. Music via Radio7: Web Player, .m3u file.

WebM is now available sitewide! Please check this thread for more info.

+--------+ +------+ 21/12/21(Tue)08:52 No. 5532 [Reply]
5532

File 164007316957.png - (356.68KB , 720x378 , QuickScreencapture_20211220-235228.png )

+------+




10 yr Enterprise Veteran Neckbearded Basement Dweller 19/09/06(Fri)23:50 No. 5299 [Reply]
5299

File 15678066177.jpg - (68.73KB , 400x271 , memer.jpg )

got any questions for me?

10 years in healthcare and finance, 12 corporations, all bullshit.

last pos. some spider experts out of TX told me someone else is going to be taking credit for my work from now on, so I told them all to gtfh.

I quit... Then they tried to destroy my career... But I'm still going strong.

Anyone want to know how broken Enterprise software is..?

I've been in environments w/ 12,000+ stored procedures, 6,000+ data tables, 4,000+ user defined functions, and 8,000+ database triggers.


5 posts omitted. Click Reply to view.
>>
Neckbearded Basement Dweller 21/07/10(Sat)18:29 No. 5475

>10 years in healthcare and finance, 12 corporations

What initiatives did you spearhead during that time?


>>
Neckbearded Basement Dweller 21/11/15(Mon)22:13 No. 5504

I think you can start from python. Nowadays very popular became video content. I used help of https://darvideo.tv/ during making promotion campaign of some applications. Most marketers feel that video is a great investment for lead generation. 84% of video marketers say video has been effective for generating leads, up 1% from 2020.


>>
Neckbearded Basement Dweller 21/12/01(Wed)11:29 No. 5513

You have huge experience. Thats great. I am still beginner and study different programming languages. You know that C was originally designed as a system implementation language within Bell Labs and has since become one of the most popular programming languages in existence. It's been used for developing operating systems, compilers, debuggers, and many other applications that involve low-level computer hardware interaction. I started to study this prograaming recently on courses. But nowadays I apply to https://www.aimprosoft.com/blog/how-to-create-mobile-ecommerce-app-costs-tech-stack-and-market-strategy/ becasue want to develop own second ecommerce store.




+------+ +--------+ 21/12/19(Sun)02:05 No. 5526 [Reply]
5526

File 163987591666.png - (253.57KB , 479x398 , QuickScreencapture_20211218-155354.png )

...


>>
Neckbearded Basement Dweller 21/12/19(Sun)09:49 No. 5530

In our time we need to check more attention onto logo and design of own websites. A good logo is simple, distinctive, and reflective of your brand values. It should be practical to use and feature a singular design. You can find more tips about logo and design on https://www.namecheap.com/guru-guides/logo-design-tips/




Using text recognition software on website archives? Neckbearded Basement Dweller 20/12/05(Sat)17:31 No. 5435 [Reply]
5435

File 160718589761.jpg - (41.84KB , 640x480 , 1607051047913.jpg )

Null of Kiwi Farms recently stated that he'll release archive of the site minus user data when section 230 is repealed, in torrent form. Does anyone have any advice on automating text recognition software on that archive so I can build a database of hashes for comparison to other sites?

I'd love nothing more than for every Kiwi Farms user to be doxed and sued into oblivion.


9 posts omitted. Click Reply to view.
>>
Jones 21/11/11(Thu)09:16 No. 5502

When I need any software, I ask for help from specialists, since I do not have time to understand all this. If you are interested, then on the https://8allocate.com/dedicated-teams/, you will be able to read more detailed information. Maybe you can find help here to build the software you need. Have a nice day.


>>
Neckbearded Basement Dweller 21/11/17(Wed)11:45 No. 5505

I like cold coffee any time of the year


>>
Neckbearded Basement Dweller 21/12/10(Fri)10:19 No. 5523

>>5435
>Can't tell which end of the bell curve you're on, but it's not the middle
Typically, hashes are unique for any input. In fact, they are designed for it to be statistically impossible to find 2 things with the same hash. Hardly useful for identifying the same author of different texts. However, there is a concept of locality-sensitive hashing which might be useful or unnecessary for you.

What you want is some machine learning model that takes in text and spits out a lower-dimensional vector describing the text (analogous to a convolutional net for images). Then, you can apply locality-sensitive hashing to collapse similar vectors to vectors you use as identities. For outliers, you would even be able to say % chance of each identity by calculating a simple projection in each.

However, if you do the machine learning properly, you won't need hashing; it's just a way to calibrate the model after the fact if you fuck up.

Now, for how to create and train the model, the first step is data. Ideally, you would have a large, diverse corpus of text labelled by author. If not, this is easy enough to create by scraping the web.

This problem is very similar to facial identification (different from facial recognition, which would be like a program that decides whether text was produced randomly or intelligently, not who produced it) in that both are solved by transforming the data into a vector of features describing the data which can then be compared to each other in the resulting metric space, and that both fundamentally deal with identity. Ie, the principle component of the facial metric space probably corresponds to gender. By comparing how white, how black, how asian, how fat, how thin, how masculine, how feminine, etc, the faces in multiple pictures are, you can tell which ones are probably of the same person. It is the same process for identifying the author of a text. The text has analogous features like syntax, verbosity, vocabulary, tone, etc, when taken together, can identify an author.

Anyway, set up a model that has an appropriate input size for your data and and your best guess at how many features you need for output (you will tweak this until you stop seeing improvement). Cost function should be (euclidian proportional, but more computationally efficient) distance of output vector from average of outputs for the same author minus the sum of the distances from each of the other authors' averages, normalized. If it doesn't work after completing training, mess with the output size and try again. Once it's as good as it'll get, optionally apply locality-sensitive hashing for maximum effort.






+------+ 21/12/03(Fri)07:45 No. 5520 [Reply]
5520

File 163851394196.png - (75.84KB , 274x181 , QuickScreencapture_20211202-224345.png )








+------+ 21/12/03(Fri)07:18 No. 5517 [Reply]
5517

File BuildCharacter.txt - (107.21KB )




+------+ 21/12/03(Fri)07:17 No. 5516 [Reply]
5516

File choicescript_stats.txt - (80.85KB )





Delete post []
Password  
Report post
Reason