πΊ screenpipe #006 | search interface, 600 mb 10% CPU, window filtering
screenpipe #006
β¬οΈ download mac m app
β¬οΈ download windows app
give us a βοΈ(11.5k) to help the algorithm
hey louis & matt here
btw, in this newsletter, we share new features, bug fixes, work in progress, and what's next.
as a reminder, screenpipe is an open source library and an app to record data from your screens & mics and pass it into a llm. an ai powered by what you've seen, said, or heard:
β¨ what's new
π search
you can now search your desktop history the old-school wayβby keywords, dates, apps, or even that random window you accidentally closed
it's basic for now and still a work in progress. at the same time, we're going to make the chat more intuitive and require less prompt engineering in the future
πͺ window, app, tab filtering
record specific windows, apps, tabs, or filter them out
for example:
- filter out all windows that contain ".env" in the title, good for devs
- ignore all windows that contain "bit". it will ignore all windows and tabs containing "bit" like "bitwarden", "bittorent", etc.
- or select "chrome" if you want to record only google chrome
π new pipes (plugins)
- security pipe that warns user about potentially dangerous actions, e.g. phishing e-mails (demo)
- put ai guardrails around humans: llama 3.1 blocking user keyboard and mouse when it identifies unwanted behavior, proof-of-concept (demo)
βοΈ other new features
- send desktop notifications from screenpipe at your choice
- macos: no permission issues anymore, smooth installation experience
- screenpipe plugins: pipes, pass configs, write typescript
- multi-monitor recording & ocr now in settings in ui and cli
- new voice detection implementation with silero vad
- no windows defender alerts
- linux wayland fix (wip)
- known issues: sometimes macos audio output crashes, work around is to create a virtual audio device that contains the system audio or uses restart interval settings which forces a restart of the process
π next
- use case: automatic summaries for audio/text conversation by speaker
- use case: highlight grammar suggestions on the screen in real-time
- more examples of preventive actions by llm, notifications + taking over control
- more advanced desktop control in pipes, more examples
- installation, stability & performance cross-platform, fix known issues
- search: more relevancy, less redundant information
- new models: brand new local speech to text model - silero
- new models: add new ocr engine (esp. for linux)
- security: mp4 encryption at rest
- storage: reduce storage required by half using h265 encoding
- ux: automatic audio device switch
- ux: more reliable interface, specific interface for audio summaries, global shortcut, cursor-alike chat, shortcuts, etc.
- ux: more customizable app ai settings (use your own openai api compatible URL)
- extensions: plenty of plugins you can install in a click (or build yourself) to get the most out of your data
- extensions: use native api (control your computer, display on the screen, etc.), real-time data streaming, high level abstraction in pipes
btw we're running a bunch of paid bounties (up to $200 atm) to make screenpipe even better, just go to github issues and check issues with label "bounty"
- the app is still in alpha and we've fixed tons of bugs, however, we're releasing daily updates to fix them, along with new features. we're a two-person team, but we have open source contributors joining, and we would be happy to welcome more! βΊοΈπ
links
take care,
screenpipe
wanna chat?
You are receiving this email because you opted-in to receive updates from Mediar, Inc
Mediar, Inc, 2 Marina Blvd B300, San Francisco, CA 94123